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METHODS OF DIAGNOSIS OF ANGIOGENESIS, COMPOSITIONS 
AND METHODS OF SCREENING FOR ANGIOGENESIS 
5 MODULATORS 



CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority to USSN 09/784,356, filed February 14 2001; 
10 USSN 09/791,390, filed February 22, 2001; USSN 60/285,475, filed April 19, 2001, USSN 
60/310,025, filed August 3, 2001, and USSN 60/334,244, filed November 29, 2001, each of 
which is herein incorporated by reference in its entirety. 

FIELD OF THE INVENTION 
1 5 The invention relates to the identification of nucleic acid and protein 

expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
angiogenesis; and to the use of such expression profiles and compositions in diagnosis and 
therapy of angiogenesis. The invention further relates to methods for identifying and using 
agents and/or targets that modulate angiogenesis. 

20 

BACKGROUND OF THE INVENTION 
Both vasculogenesis, the development of an interactive vascular system 
comprising arteries and veins, and angiogenesis, the generation of new blood vessels, play a 
role in embryonic development. In contrast, angiogenesis is limited in a normal adult to the 

25 placenta, ovary, endometrium and sites of wound healing. However, angiogenesis, or its 

absence, plays an important role in the maintenance of a variety of pathological states. Some 
of these states are characterized by neovascularization, e.g., cancer, diabetic retinopathy, 
glaucoma, and age related macular degeneration. Others, e.g., stroke, infertility, heart 
disease, ulcers, and scleroderma, are diseases of angiogenic insufficiency. 

30 Angiogenesis has a number of stages (see, e.g., Folkman, J.Natl Cancer Inst 

82:4-6, 1990; Firestein, J Clin Invest. 103:3-4, 1999; Koch, Arthritis Rheum AVBS 1-62, 1998; 
Carter, Oncologist 5(Suppl l):51-4, 2000; Browder et al., Cancer Res. 60:1878-86, 2000; and 
Zhu and Witte, Invest New Drugs 17:195-212, 1999). The early stages of angiogenesis 
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include endothelial cell protease production, migration of cells, and proliferation. The early 
stages also appear to require some growth factors, with VEGF, TGF-ct, angiostatin, and 
selected chemokines all putatively playing a role. Later stages of angiogenesis include 
population of the vessels with mural cells (pericytes or smooth muscle cells), basement 
membrane production, and the induction of vessel bed specializations. The final stages of 
vessel formation include what is known as "remodeling", wherein a forming vasculature 
becomes a stable, mature vessel bed. Thus, the process is highly dynamic, often requiring 
coordinated spatial and temporal waves of gene expression. 

Conversely, the complex process may be subject to disruption by interfering 
with one or more critical steps. Thus, the lack of understanding of the dynamics of 
angiogenesis prevents therapeutic intervention in serious diseases such as those indicated. It 
is an object of the invention to provide methods that can be used to screen compounds for the 
ability to modulate angiogenesis. Additionally, it is an object to provide molecular targets for 
therapeutic intervention in disease states which either have an undesirable excess or a deficit 
in angiogenesis. The present invention provides solutions to both. 

SUMMARY OF THE INVENTION 
The present invention provides compositions and methods for detecting or 
modulating angiogenesis associated sequences. 
20 In one aspect, the invention provides a method of detecting an angiogenesis- 

associated transcript in a cell in a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridized to a sequence at 
least 80% identical to a sequence as shown in Tables 1-8. In one embodiment, the biological 
sample is a tissue sample. In another embodiment, the biological sample comprises isolated 
25 nucleic acids, which are often mRNA. 

In another embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 
Often, the polynucleotide comprises a sequence as shown in Tables 1-8. The polynucleotide 
can be labeled, for example, with a fluorescent label and can be immobilized on a solid 
30 surface. 

In other embodiments the patient is undergoing a therapeutic regimen to treat a 
disease associated with angiogenesis or the patient is suspected of having an angiogenesis- 
associated disorder. 



10 



2 



WO 02/079492 PCT/US02/04915 

In another aspect, the invention comprises an isolated nucleic acid molecule 

consisting of a polynucleotide sequence as shown in Tables 1-8. The nucleic acid molecule 

can be labeled, for example, with a fluorescent label, 

In other aspects, the invention provides an expression vector comprising an 
5 isolated nucleic acid molecule consisting of a polynucleotide sequence as shown in Tables 1- 

8 or a host cell comprising the expression vector- 
In another embodiment, the isolated nucleic acid molecule encodes a 

polypeptide having an amino acid sequence as shown in Table 8. 

In another aspect, the invention provides an isolated polypeptide which is 
10 encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-8. 

In one embodiment, the isolated polypeptide has an amino acid sequence as shown in Table 

8. 

In another embodiment, the invention provides an antibody that specifically 

binds a polypeptide that has an amino acid sequence as shown in Table 8 or which is encoded 
15 by a nucleotide sequence of Tables 1 -8 . The antibody can be conjugated or fused to an 

effector component such as a fluorescent label, a toxin, or a radioisotope. In some . 

embodiments, the antibody is an antibody fragment or a humanized antibody. 

In another aspect, the invention provides a method of detecting a cell 

undergoing angiogenesis in a biological sample from a patient, the method comprising 
20 contacting the biological sample with an antibody that specifically binds to a polypeptide that 

has an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide 

sequence of Tables 1-8 . In some embodiments, the antibody is further conjugated or fused to 

an effector component, for example, a fluorescent label. 

In another embodiment, the invention provides a method of detecting 
25 antibodies specific to angiogenesis in a patient, the method comprising contacting a 

biological sample from the patient with a polypeptide which is encoded by a nucleotide 

sequence of Tables 1-8. 

The invention also provides a method of identifying a compound that 

modulates the activity of an angiogenesis-associated polypeptide, the method comprising the 
30 steps of: (i) contacting the compound with a polypeptide that comprises at least 80% identity 

to an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide sequence 

of Tables 1-8; and (ii) detecting an increase or a decrease in the activity of the polypeptide. 

In one embodiment, the polypeptide has an amino acid sequence as shown in Table 8 or is a 
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polypeptide encoded by a nucleotide sequence of Tables 1-8. Li another embodiment, the 
polypeptide is expressed in a cell. 

The invention also provides a method of identifying a compound that 
modulates angiogenesis, the method comprising steps of: (i) contacting the compound with a 
5 cell undergoing angiogenesis; and (ii) detecting an increase or a decrease in the expression of 
a polypeptide sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8. In one embodiment, the detecting step comprises 
hybridizing a nucleic acid sample from the cell with a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-8. In 

1 0 another embodiment, the method further comprises detecting an increase or decrease in the 
expression of a second sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8 . 

In another embodiment, the invention provides a method of inhibiting 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 

15 shown in Table 8 or which is 80% identical to a polypeptide encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amount of an inhibitor of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8 . In another embodiment, the inhibitor is an 

20 antibody. 

In other embodiments, the invention provides a method of activating 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or at least 80% identical to a polypeptide which is encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
25 therapeutically effective amount of an activator of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8. 

Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 
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Tables 1-8 provide nucleotide sequence of genes that exhibit changes in 
expression levels as a function of time in tissue undergoing angiogenesis compared to tissue 
that is not. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of disorders associated with angiogenesis 
(sometimes referred to herein as angiogenesis disorders or AD), as well as methods for 
5 screening for compositions which modulate angiogenesis. By "disorder associated with 

angiogenesis" or "disease associated with angiogenesis" herein is meant a disease state which 
is marked by either an excess or a deficit of blood vessel development. Angiogenesis 
disorders asociated with increased angiogenesis include, but are not limited to, cancer and 
proliferative diabetic retinopathy. Pathological states for which it may be desirable to 

10 increase angiogenesis include stroke, heart disease, infertility, ulcers, wound healing, 

ischemia, and scleradoma. Solid tumors typically require angiogenesis to support or sustain 
growth, e.g., breast, colon, lung, brain, bladder, and prostate tumors. Other AD include, e.g., 
arthritis, inflammatory bowel disease, diabetis retinopathy, macular degeneration, 
atherosclerosis, and psoriasis. Also provided are methods for treating AD. 

15 Definitions 

The term "angiogenesis protein" or "angiogenesis polynucleotide" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have an amino acid sequence that has greater than about 60% amino acid 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 

20 96%, 97%, 98% or 99% or greater amino acid sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more amino acids, to an 
angiogenesis protein sequence of Table 8; (2) bind to antibodies, e.g., polyclonal antibodies, 
raised against an immunogen comprising an amino acid sequence of Table 8, and 
conservatively modified variants thereof; (3) specifically hybridize under stringent 

25 hybridization conditions to an anti-sense strand corresponding to a nucleic acid sequence of 
Tables 1-8 and conservatively modified variants thereof; (4) have a nucleic acid sequence 
that has greater than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or 
higher nucleotide sequence identity, preferably over a region of at least about 25, 50, 100, 
200, 500, 1000, or more nucleotides, to a sense sequence corresponding to one set out in 

30 Tables 1-8 . A polynucleotide or polypeptide sequence is typically from a mammal 

including, but not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal. An "angiogenesis polypeptide" and an "angiogenesis 
polynucleotide," include both naturally occurring or recombinant. 
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A "full length" angiogenesis protein or nucleic acid refers to an agiogenesis 
polypeptide or polynucleotide sequence, or a variant thereof; that contains all of the elements 
normally contained in one or more naturally occurring, wild type angiogenesis polynucleotide 
or polypeptide sequences. The "fall length" may be prior to, or after, various stages of post- 
5 translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of an angiogenic protein. Such samples include, 
but are not limited to, tissue isolated from primates, e.g., humans, or rodents, e.g., mice, and 
rats. Biological samples may also include sections of tissues such as biopsy and autopsy 
1 0 samples, and frozen sections taken for histologic purposes. A biological sample is typically 
obtained from a eukaryotic organism, most preferably a mammal such as a primate e.g., 
chimpanzee or human; cow; dog; cat; a rodent, e.g. 9 guinea pig, rat, mouse; rabbit; or a bird; 
reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
15 methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome histroy, will 
be particularly useful. 

20 The terms "identical" or percent "identity," in the context of two or more 

nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., SEQ ID NOS: 1-229), 

25 when compared and aligned for maximum correspondence over a comparison window or 
designated region) as measured using a BLAST or BLAST 2.0 sequence comparison 
algorithms with default parameters described below, or by manual alignment and visual 
inspection (see, e.g. 9 NCBI web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such 
sequences are then said to be "substantially identical." This definition also refers to, or may 

30 be applied to, the compliment of a test sequence. The definition also includes sequences that 
have deletions and/or additions, as well as those that have substitutions. As described below, 
the preferred algorithms can account for gaps and the like. Preferably, identity exists over a 
region that is at least about 25 amino acids or nucleotides in length, or more preferably over a 
region that is 50-100 amino acids or nucleotides in length. 
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For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
5 program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 

10 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl 

15 Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Natl 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

20 visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et al, eds. 
1995 supplement)). 

A preferred example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et al t J. 

25 Mol Biol 215:403-410 (1990), respectively. BLAST and BLAST 2.0 are used, with the 

parameters described herein, to determine percent sequence identity for the nucleic acids and 
proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 

30 short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et a/., supra). 
These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence for as 
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far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
5 word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

10 uses as defaults a wordlength (W) of 11, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

15 The BLAST algorithm also performs a statistical analysis of the similarity 

between two sequences (see, e.g. 9 Karlin & Altschul, Proc. Natl Acad Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N))> which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 

20 nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 

25 reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, for example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules or their complements hybridize to each other under stringent conditions, as 

30 described below. Yet another indication that two nucleic acid sequences are substantially 
identical is that the same primers can be used to amplify the sequences. 

A 4< host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 
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prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "polypeptide," peptide" and "protein" are used interchangeably 
5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers and 
non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 

10 as well as amino acid analogs and amino acid mimetics that function in a manner similar to 
the naturally occurring amino acids. Naturally occurring amino acids are those encoded by 
the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 

15 bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norieucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups (e.g., norieucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical 
compounds that have a structure that is different from the general chemical structure of an 

20 amino acid, but that functions in a manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

25 "Conservatively modified variants" applies to both amino acid and nucleic 

acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 

30 number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
. position where an alanine is specified by a codon, the codon can be altered to any of the 
corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 



9 



WO 02/079492 



PCT/US02/04915 



variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 
in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
5 identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a 
polypeptide is implicit in each described sequence with respect to the expression product, but 
not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

10 sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant* ' where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 

15 polymorphic variants, interspecies homologs, and alleles of the invention. 

The following eight groups each contain amino acids that are conservative 
substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid 
(E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 

20 (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, eg., Creighton, 
Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g, Alberts et at., Molecular Biology of the Cell (3 rd ed., 1994) and Cantor and Schimmel, 

25 Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
form a compact unit of the polypeptide and are typically 25 to approximately 500 amino 

30 acids long. Typical domains are made up of sections of lesser organization such as stretches 
of p-sheet and a-helices. 'Tertiary structure" refers to the complete three dimensional 
structure of a polypeptide monomer. "Quaternary structure" refers to the three dimensional 
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structure formed, usually by the noncovalent association of independent tertiary units. 
Anisotropic terms are also known as energy terms. 

A "label" or a "detectable moiety'' is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
5 means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 

10 is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, for example, detection moieties 
including radioactive compounds, fluroescent compounds, an enzyme or substrate, tags such 
as epitope tags, a toxin; a chemotherapeutic agent; a lipase; an antibiotic; or a radioisotope 

15 emitting "hard" e.g. , beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 

20 using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g., biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 

25 through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere 
with hybridization. Thus, for example, probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be 

30 understood by one of skill in the art that probes may bind target sequences lacking complete 
complementarity with the probe sequence depending upon the stringency of the hybridization 
conditions. The probes are preferably directly labeled as with isotopes, chromOphores, 
lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin 
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complex may later bind. By assaying for the presence or absence of the probe, one can detect 
the presence or absence of the select sequence or subsequence. 

The term "recombinant" when used with reference, e.g 9 to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
5 modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for 
example, recombinant cells express genes that are not found within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under expressed or not expressed at all. 

10 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences from unrelated genes arranged to 
make a new functional nucleic acid, e.g,, a promoter from one source and a coding region 

15 from another source. Similarly, a heterologous protein indicates that the protein comprises 
two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

20 acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

25 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

30 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

10 Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 
thermal melting point (T„0 for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 

15 of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 

20 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following; 50% formamide, 5x SSC, and 1% SDS, 

25 incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 

30 to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 
30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 rnin. Protocols and guidelines for low and high stringency amplification 
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reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
5 This occurs, for example, when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C A positive hybridization is at least twice 

10 background. Those of ordinary skill will readily recognize that alternative hybridization and * 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

The phrase "functional effects" in the context of assays for testing compounds 

1 5 that modulate activity of an angiogenesis protein includes the determination of a parameter 
that is indirectly or directly under the influence of the angiogenesis protein, e.g., a functional, 
physical, or chemical effect, such as the ability to increase or decrease angiogenesis. It 
includes binding activity, the ability of cells to proliferate, expression in cells undergoing 
angiogenesis, and other characteristics of angiogenic cells. "Functional effects" include in 

20 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of an 
angiogenesis protein sequence, e.g., functional, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

25 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the angiogenesis protein; 
measuring binding activity or binding assays, e.g. binding to antibodies, and measuring 
cellular proliferation, particularly endothelial cell proliferation, cell viability, cell division 

30 especially of endothelial cells, lumen formation and capillary or vessel growth or formation. 
Determination of the functional effect of a compound on angiogenesis can also be performed 
using angiogenesis assays known to those of skill in the art such as an in vitro assays, e.g., in 
vitro endothelial cell tube formation assays, and other assays such as the chick CAM assay, 
the mouse corneal assay, and assays that assess vascularization of an implanted tumor. The 
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functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
e.g. y tube or blood vessel formation, measurement of changes in RNA or protein levels for 
angiogenesis-associated sequences, measurement of RNA stability, identification of 
5 downstream or reporter gene expression (CAT, luciferase, fj-gal, GFP and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of angiogenic polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 

10 identified using in vitro and in vivo assays of angiogenic polynucleotide and polypeptide 
sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block activity, 
decrease, prevent, delay activation, inactivate, desensitize, or down regulate the activity or 
expression of angiogenesis proteins, e.g., antagonists. "Activators" are compounds that 
increase, open, activate, facilitate, enhance activation, sensitize, agonize, or up regulate 

15 angiogenesis protein activity. Inhibitors, activators, or modulators also include genetically 
modified versions of angiogenesis proteins, e.g, versions with altered activity, as well as 
naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small chemical 
molecules and the like. Such assays for inhibitors and activators include, e.g., expressing the 
angiogenic protein in vitro, in cells, or cell membranes, applying putative modulator 

20 compounds, and then determining the functional effects on activity, as described above. 
Activators and inhibitors of angiogenesis can also be identified by incubating angiogenic 
cells with the test compound and determining increases or decreases in the expression of 1 or 
more angiogenesis proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more angiogenesis 
proteins, such as angiogenesis proteins comprising the sequences set out in Table 8. 

25 Samples or assays comprising angiogenesis proteins that are treated with a 

potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 

30 preferably 50%, more preferably 25-0%. Activation of an angiogenesis polypeptide is 

achieved when the activity value relative to the control (untreated with activators) is 110%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000% higher. 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes* as well as the myriad immunoglobulin variable region 
5 genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody 
will be most critical in specificity and affinity of binding. 

An exemplary immunoglobulin (antibody) structural unit comprises a 

10 . tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-teiminus 
of each chain defines a variable region of about 100 to 110 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. 

15 Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 

characterized fragments produced by digestion with various peptidases. Thus, for example, 
pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, 
a dimer of Fab which itself is a light chain joined to V H -C H 1 by a disulfide bond. The F(ab)' 2 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 

20 thereby converting the F(ab)' 2 dimer into an Fab' monomer. The Fab' monomer is 

essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 

25 herein, also includes antibody fragments either produced by the modification of whole 

antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et aL 3 Nature 
348:552-554 (1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 

30 antibodies, many technique known in the art can be used (see, e.g. , Kohler & Milstein, 

Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4: 72 (1983); Cole et al, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy, AlanR. Liss, Inc. (1985); Coligan, 
Current Protocols in Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual 
(1988); and Goding, Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). 
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Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 
other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
5 Fab fragments that specifically bind to selected antigens {see, e.g. , McCafferty et al , Nature 
348:552-554 (1990); Marks et al., Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
10 and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g. y an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

The detailed description of the invention includes discussion of the following 
1 5 aspects of the invention: Expression of angiogenesis-associated sequences 

Informatics 

Angiogenesis-associated sequences 

Detection of angiogenesis sequence for diagnostic and 
therapeutic applications 

20 Modulators of angiogenesis 

Methods of identifying variant angiogenesis-associated 
sequences 

Administration of pharmaceutical and vaccine compositions 
Kits for use in diagnostic and/or prognostic applications. 

25 Expression of angiogenesis-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 

30 a number of genes simultaneously allows the generation of a gene expression profile that is 
unique to the state of the cell. That is, normal tissue may be distinguished from AD tissue. 
By comparing expression profiles of tissue in known different angiogenesis states, 
information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. The identification of sequences that are 
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differentially expressed in angiogenic versus non-angiogenic tissue allows the use of this 
information in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drug act to down-regulate angiogenesis, and thus tumor 
growth or recurrence, in a particular patient Similarly, diagnosis and treatment outcomes 
5 may be done or confirmed by comparing patient samples with the known expression profiles. 
Angiogenic tissue can also be analyzed to determine the stage of angiogenesis in the tissue. 
Furthermore, these gene expression profiles (or individual genes) allow screening of drug 
candidates with an eye to mimicking or altering a particular expression profile; for example, 
screening can be done for drugs that suppress the angiogenic expression profile. This may be 

10 done by making biochips comprising sets of the important angiogenesis genes, which can 
then be used in these screens. These methods can also be done on the protein basis; that is, 
protein expression levels of the angiogenic proteins can be evaluated for diagnostic purposes 
or to screen candidate agents. In addition, the angiogenic nucleic acid sequences can be 
administered for gene therapy purposes, including the administration of antisense nucleic 

1 5 acids, or the angiogenic proteins (including antibodies and other modulators thereof) 
administered as therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in angiogenesis, herein termed "angiogenesis sequences". As 
outlined below, angiogenesis sequences include those that are up-regulated (i.e. expressed at 

20 a higher level) in disorders associated with angiogenesis, as well as those that are down- 
regulated (i.e. expressed at a lower level). In a preferred embodiment, the angiogenesis 
sequences are from humans; however, as will be appreciated by those in the art, angiogenesis 
sequences from other organisms may be useful in animal models of disease and drug 
evaluation; thus, other angiogenesis sequences are provided, from vertebrates, including 

25 mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals 
(including sheep, goats, pigs, cows, horses, etc). Angiogenesis sequences from other 
organisms may be obtained using the techniques outlined below. 

Angiogenesis sequences can include both nucleic acid and amino acid 
sequences. In a preferred embodiment, the angiogenesis sequences are recombinant nucleic 

30 acids. By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed 
in vitro, in general, by the manipulation of nucleic acid e.g., using polymerases and 
endonucleases, in a form not normally found in nature. Thus an isolated nucleic acid, in a 
linear form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 
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understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, z.e. using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
5 recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant 
techniques, i.e. through die expression of a recombinant nucleic acid as depicted above. A 
recombinant protein is distinguished from naturally occurring protein by at least one or more 
characteristics. For example, the protein may be isolated or purified away from some or all 

10 of the proteins and compounds with which it is normally associated in its wild type host, and 
thus may be substantially pure. For example, an isolated protein is unaccompanied by at least 
some of the material with which it is normally associated in its natural state, preferably 
constituting at least about 0.5%, more preferably at least about 5% by weight of the total 
protein in a given sample. A substantially pure protein comprises at least about 75% by 

1 5 weight of the total protein, with at least about 80% being preferred, and at least about 90% 
being particularly preferred. The definition includes the production of an angiogenesis protein 
from one organism in a different organism or host cell. Alternatively, the protein may be 
made at a significantly higher Concentration than is normally seen, through the use of an 
inducible promoter or high expression promoter, such that the protein is made at increased 

20 concentration levels. Alternatively, the protein may be in a form not normally found in 
nature, as in the addition of an epitope tag or amino acid substitutions, insertions and 
deletions, as discussed below. 

In a preferred embodiment, the angiogenesis sequences are nucleic acids. As 
will be appreciated by those in the art and is more fully outlined below, angiogenesis 

25 sequences are useful in a variety of applications, including diagnostic applications, which will 
detect naturally occurring nucleic acids, as well as screening applications; for example, 
biochips comprising nucleic acid probes to the angiogenesis sequences can be generated. In 
the broadest sense, then, by "nucleic acid 5 ' or "oligonucleotide" or grammatical equivalents 
herein means at least two nucleotides covalently linked together. A nucleic acid of the 

30 present invention will generally contain phosphodiester bonds, although in some cases, 
nucleic acid analogs are included that may have alternate backbones, comprising, for 
example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages, (see Eckstein, Oligonucleotides and Analogues: A Practical 
Approach, Oxford University Press); and peptide nucleic acid backbones and linkages. Other 
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analog nucleic acids include those with positive backbones; non-ionic backbones, and non- 
ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, 
and Chapters 6 and 7, ASC Symposium Series 580, "Caibohydrate Modifications in 
Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or 
more carbocyclic sugars are also included within one definition of nucleic acids. 
Modifications of the ribose-phosphate backbone may be done for a variety of reasons, for 
example to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. 

As will be appreciated by those in the art, nucleic acid analogs may find use in 
the present invention. In addition, mixtures of naturally occurring nucleic acids and analogs 
can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of 
naturally occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 
contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 
due to their non-ionic nature, hybridization of the bases attached to these backbones is 
relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
by those in the art, the depiction of a single strand also defines the sequence of the 
complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term "nucleoside" 
includes nucleotides and nucleoside and nucleotide analogs, and modified nucleosides such 
as amino modified nucleosides. In addition, "nucleoside" includes non-naturally occurring 
analog structures. Thus for example the individual units of a peptide nucleic acid, each 
containing a base, are referred to herein as a nucleoside. 
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An angiogenesis sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the angiogenesis sequences outlined herein. 
Such homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 
5 conditions. 

For identifying angiogenesis-associated sequences, the angiogenesis screen 
typically includes comparing genes identified in a modification of an in vitro model of 
angiogenesis as described in Hiraoka, Cell 95:365 (1998) with genes identified in controls. 
Samples of normal tissue and tissue undergoing angiogenesis are applied to biochips 

10 comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, for example from Affymetrix. Gene expression profiles as described 
herein are generated and the data analyzed. 

In a preferred embodiment, the genes showing changes in expression as 

15 between normal and disease states are compared to genes expressed in other normal tissues, 
including, but not limited to lung, heart, brain, liver, breast, kidney, muscle, prostate, small 
intestine, large intestine, spleen, bone and placenta. In a preferred embodiment, those genes 
identified during the angiogenesis screen that are expressed in any significant amount in other 
tissues are removed from the profile, although in some embodiments, this is not necessary. 

20 That is, when screening for drugs, it is usually preferable that the target be disease specific, to 
minimize possible side effects. 

In a preferred embodiment, angiogenesis sequences are those that are up- 
regulated in angiogenesis disorders; that is, the expression of these genes is higher in the 
disease tissue as compared to normal tissue. 'Tip-regulation" as used herein means at least 

25 about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All accession numbers herein are for the GenBank 
sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is known in the art, see, Benson, DA, et al., 
Nucleic Acids Research 26: 1-7 (1998) and http://www.ncbi.nlm.nih.gov/. Sequences are also 

30 avialable in other databases, e.g., European Molecular Biology Laboratory (EMBL) and 
DNA Database of Japan (DDB J). In addition, most preferred genes were found to be 
expressed in a limited amount or not at all in heart, brain, lung, liver, breast, kidney, prostate, 
small intestine and spleen 
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In another preferred embodiment, angiogenesis sequences are those that are 
down-regulated in the angiogenesis disorder; that is, the expression of these genes is lower in 
angiogenic tissue as compared to normal tissue. "Down-regulation" as used herein means at 
least about a two-fold change, preferably at least about a three fold change, with at least about 
5 five-fold or higher being preferred. 

Angiogenesis sequences according to the invention may be classified into 
discrete clusters of sequences based on common expression profiles of the sequences. 
Expression levels of angiogenesis sequences may increase or decrease as a function of time in 
a manner that correlates with the induction of angiogenesis. Alternatively, expression levels 

10 of angiogenesis sequences may both increase and decrease as a function of time. For 
example, expression levels of some angiogenesis sequences are temporarily induced or 
diminished during the switch to the angiogenesis phenotype, followed by a return to baseline 
expression levels. Tables 1-8 provides genes, the mRNA expression of which varies as a 
function of time in angiogenesis tissue when compared to normal tissue. 

15 In a particularly preferred embodiment, angiogenesis sequences are those that 

are induced for a period of time, typically by positive angiogenic factors, followed by a return 
to the baseline levels. Sequences that are temporarily induced provide a means to target 
angiogenesis tissue, for example neovascularized tumors, at a particular stage of 
angiogenesis, while avoiding rapidly growing tissue that require perpetual vascularization. 

20 Such positive angiogenic factors include aFGF, (JFGF, VEGF, angiogenin and the like. 

Induced angiogenesis sequences also are further categorized with respect to 
the timing of induction. For example, some angiogenesis genes may be induced at an early 
time period, such as within 10 minutes of the induction of angiogenesis. Others may be 
induced later, such as between 5 and 60 minutes, while yet others may be induced for a time 

25 period of about two hours or more followed by a return to baseline expression levels. 

In another preferred embodiment are angiogenesis sequences that are inhibited 
or reduced as a function of time followed by a return to ''normal" expression levels. 
Inhibitors of angiogenesis are examples of molecules that have this expression profile. These 
sequences also can be further divided into groups depending on the timing of diminished 

30 expression. For example, some molecules may display reduced expression within 10 minutes 
of the induction of angiogenesis. Others may be diminished later, such as between 5 and 60 
minutes, while others may be diminished for a time period of about two hours or more 
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followed by a return to baseline. Examples of such negative angiogenic factors include 
thrombospondin and endostatin to name a few. 

In yet another preferred embodiment are angiogenesis sequences that are 
induced for prolonged periods. These sequences are typically associated with induction of 
5 angiogenesis and may participate in induction and/or maintenance of the angiogenesis 
phenotype. 

In another preferred embodiment are angiogenesis sequences, the expression 
of which is reduced or diminished for prolonged periods in angiogenic tissue. These 
sequences are typically angiogenesis inhibitors and their diminution is correlated with an 
10 increase in angiogenesis. 

Informatics 

The ability to identify genes that undergo changes in expression with time 
during angiogenesis can additionally provide high-resolution, high-sensitivity datasets which 

1 5 can be used in the areas of diagnostics, therapeutics, drug development, biosensor 

development, and other related areas. For example, the expression profiles can be used in 
diagnostic or prognostic evaluation of patients with angiogenesis-associated disease. Or as 
another example, subcellular toxicological information can be generated to better direct drug 
structure and activity correlation {see, Anderson, L., "Pharmaceutical Proteomics: Targets, 

20 Mechanism, and Function," paper presented at the BC Proteomics conference, Coronado, 
CA (June 1 1-12, 1998)). Subcellular toxicological information can also be utilized in a 
biological sensor device to predict the likely toxicological effect of chemical exposures and 
likely tolerable exposure thresholds (see 9 U.S. Patent No. 5,81 1,231). Similar advantages 
accrue from datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, 

25 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of data assay data. The data contained in the database is acquired , 
e.g., using array analysis either singly or in a library format. The database can be in 
substantially any form in which data can be maintained and transmitted, but is preferably an 

30 electronic database. The electronic database of the invention can be maintained on any 
electronic device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
5 and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing angiogenesis, i.e., the identification of angiogenesis-associated 
sequences described herein, provide an abundance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 

10 status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 

1 5 database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 

20 for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 

25 in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 

30 Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 
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The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 
5 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for angiogenesis. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

10 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

1 5 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

20 transistor gate states, such as an array of cells in a DRAM device (e.g. , each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

25 When the target is a peptide or nucleic acid, the invention preferably provides 

a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 

30 embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 
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1 

(e.g., Linux, SunOS, Solaris, ADC, SCO Unix, VMS, MV, Macintosh, etc.). floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 
5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal tranmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

1 0 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 

15 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

20 initialized to load and execute the computer program for alignment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 

25 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
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an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
5 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer, (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 



1 0 Angiogenesis-associated sequences 

Angiogenesis proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment, the 
angiogenesis protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus or associated with the intracellular side of the plasma 

15 membrane. Intracellular proteins are involved in all aspects of cellular function and 

replication (including, e.g., signaling pathways); aberrant expression of such proteins often 
results in unregulated or disregulated cellular processes (see, e.g., Molecular Biology of the 
Cell, 3rd Edition, Alberts, Ed., Garland Pub., 1994). For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

20 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 

25 in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 

30 domains, also bind tyrosine phosphorylated targets. SID domains bind to proline-rich 

targets. In addition, PH domains, tetratricbpeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
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sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

In another embodiment, the angiogenesis sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
5 They may have an intracellular domain, an extracellular domain, or both. The intracellular 
domains of such proteins may have a number of functions including those already described 
for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 

10 kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 

1 5 guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 

20 transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
may be followed or flanked by charged amino acids. Therefore, upon analysis of the amino 
acid sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, 

25 conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or junctions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 

30 For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell for example via a glycosylphosphatidylinositol 
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(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains also 
associate with the extracellular matrix and contribute to the maintenance of the cell structure. 

Angiogenesis proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
5 described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide acess to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 
10 be made soluble by removing transmembrane sequences, for example through recombinant 
methods. Furthermore, transmembrane proteins that have been made soluble can be made to 
be secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the angiogenesis proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 
1 5 peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine maimer (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
20 on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Angiogenesis proteins that are secreted proteins are particularly 
preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood or serum tests. 

An angiogenesis sequence is typically initially identified by substantial nucleic 
25 acid and/or amino acid sequence homology or linkage to the angiogenesis sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

30 As detailed in the definitions, percent identity can be determined using an 

algorithm such as BLAST. A preferred method utilizes the BLASTN module ofWU- 
BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 
0.125, respectively. The alignment may include the introduction of gaps in the sequences to 
be aligned. In addition, for sequences which contain either more or fewer nucleotides than 



29 



WO 02/079492 PCTAJS02/04915 

those of the nucleic acids of the figures, it is understood that the percentage of homology will 
be determined based on the number of homologous nucleosides in relation to the total number 
of nucleosides. Thus, for example, homology of sequences shorter than those of the 
sequences identified herein and as discussed below, will be determined using the number of 
5 nucleosides in the shorter sequence. 

In one embodiment, the nucleic acid homology is determined through 
hybridization studies. Thus, e.g,, nucleic acids which hybridize under high stringency to a 
nucleic acid of Tables 1-8 , or its complement, or is also found on naturally occurring 
mRNAs is considered an angiogenesis sequence. In another embodiment, less stringent 

10 hybridization conditions are used; for example, moderate or low stringency conditions may 
be used, as are known in the art; see Ausubel, supra, and Tijssen, supra. 

In addition, the angiogenesis nucleic acid sequences of the invention, e.g, the 
sequence in Tables 1-8 , are fragments of larger genes, i.e. they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 

1 5 and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the angiogenesis genes 
can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences, e.g., systems such as 

20 UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the angiogenesis nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire angiogenesis nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 

25 segment, the recombinant angiogenesis nucleic acid can be further-used as a probe to identify 
and isolate other angiogenesis nucleic acids, for example extended coding regions. It can 
also be used as a "precursor" nucleic acid to make modified or variant angiogenesis nucleic 
acids and proteins. 

The angiogenesis nucleic acids of the present invention are used in several 
30 ways. In a first embodiment, nucleic acid probes to the angiogenesis nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
or for administration, for example for gene therapy, vaccine, and/or antisense applications. 
Alternatively, the angiogenesis nucleic acids that include coding regions of angiogenesis 
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proteins can be put into expression vectors for the expression of angiogenesis proteins, again 
for screening purposes or for administration to a patient 

In a preferred embodiment, nucleic acid probes to angiogenesis nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
5 made. The nucleic acid probes attached to the biochip are designed to be substantially 
complementary to the angiogenesis nucleic acids, ue. the target sequence (either the target 
sequence of the sample or to other probe sequences, for example in sandwich assays), such 
that hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 

10 pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 

1 5 sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 

20 from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
25 either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e. have some sequence in common), 
or separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 
As will be appreciated by those in the art, nucleic acids can be attached or 
30 immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 

15 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not ' limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 

20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluorescese. A preferred substrate is described in copending application entitled Reusable 

25 Low Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 
1999, herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 

30 sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, for 
example, the biochip is derivatized with a chemical functional group including, but not 
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limited to, amino groups, carboxy groups, oxo groups and thiol groups, with amino groups 
being particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, for example using linkers as are known in the 
5 art; for example, homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce 
Chemical Company catalog, technical section on cross-linkers, pages 155-200, incorporated 
herein by reference). In addition, in some cases, additional linkers, such as alkyl groups 
(including substituted and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
10 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
1 5 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
20 in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of angiogenesis-associated sequences. These assays are typically performed in 
25 conjunction with reverse transcription. In such assays, an angiogenesis-associated nucleic 
acid sequence acts as a template in an amplification reaction (e.g., Polymerase Chain 
Reaction, or PCR). In a quantitative amplification, the amount of amplification product will 
be proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of angiogenesis-associated RNA Methods of 
30 quantitative amplification are well known to those of skill in the art Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et al (1990) PCR Protocols,' A Guide to Methods 
and Applications, Academic Press, Inc. N.Y.). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
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dye and a 3 s quenching agent The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
5 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification {see, for example, literature provided by Pertrin-Elmer, e.g., www2.peridn- 
elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see, Wu and Wallace (1989) Genomics 4: 560, Landegren et al (1988) 
10 Science 241 : 1077, and Barringer et al (1990) Gene 89: 117), transcription amplification 

(Kwoh et al. (1989) Proc. Natl Acad Set USA 86: 1 173), self-sustained sequence replication 
(Guatelli et al (1990) Proc. Nat. Acad ScL USA 87: 1874), dot PCR, and linker adapter PCR, 
etc. 

In a preferred embodiment, angiogenesis nucleic acids, e.g., encoding 

15 angiogenesis proteins are used to make a variety of expression vectors to express 

angiogenesis proteins which can then be used in screening assays, as described below. 
Expression vectors and recombinant DNA technology are well known to those of skill in the 
art (see, e.g, Ausubel, supra, and Gene Expression Systems, Fernandez & Hoeffler, Eds, 
Academic Press, 1999) and are used to express proteins. The expression vectors may be 

20 either self-replicating extrachromosomal vectors or vectors which integrate into a host 
genome. Generally, these expression vectors include transcriptional and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the angiogenesis protein. 
The term "control sequences" refers to DNA sequences used for the expression of an 
operably linked coding sequence in a particular host organism. Control sequences that are 

25 suitable for prokaiyotes, for example, include a promoter, optionally an operator sequence, 
and a ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 

30 secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
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and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
5 nucleic acid will generally be appropriate to the host cell used to express the angiogenesis 
protein; for example, transcriptional and translational regulatory nucleic acid sequences from 
Bacillus are preferably used to express the angiogenesis protein in Bacillus. Numerous types 
of appropriate expression vectors, and suitable regulatory sequences are known in the art for 
a variety of host cells. 

10 In general, transcriptional and translational regulatory sequences may include, 

but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

15 Promoter sequences encode either constitutive or inducible promoters. The 

promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 

20 example, the expression vector may have two replication systems, thus allowing it to be 

maintained in two organisms, for example in mammalian or insect cells for expression and in 
a procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 

25 The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). See also Kitamura, et 
al. (1995) PNAS 92:9146-9150. 

In addition, in a preferred embodiment, the expression vector contains a 

30 selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The angiogenesis proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding an 
angiogenesis protein, under the appropriate conditions to induce or cause expression of the 
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angiogenesis protein. Conditions appropriate for angiogenesis protein expression will vary 
with the choice of the expression vector and the host cell, and will be easily ascertained by 
one skilled in the art through routine experimentation or optimization. For example, the use 
of constitutive promoters in the expression vector will require optimizing the growth and 
5 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 

10 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and ceil lines. 

In a preferred embodiment, the angiogenesis proteins are expressed in 

1 5 mammalian cells. Mammalian expression systems are also known in the art, and include 
retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the SV40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 

20 CMV promoter (see, eg., Fernandez & Hoeffler, supra). Typically, transcription termination 
. and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
3' to the translation stop codon and thus, together with the promoter elements, flank the 
coding sequence. Examples of transcription terminator and polyadenlytion signals include 
those derived form S V40. 

25 The methods of introducing exogenous nucleic acid into mammalian hosts, as 

well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 
encapsulation of the polynucleotide^) in liposomes, and direct microinjection of the DNA 

30 into nuclei. 

In a preferred embodiment, angiogenesis proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
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and hybrid promoters are also useful; for example, the tac promoter is a hybrid of the tap and 
lac promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 
5 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the angiogenesis protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

10 bacterial strains that have been transformed. Suitable selection genes include genes which 
render the bacteria resistant to drugs such as ampiciilin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 

15 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, angiogenesis proteins are produced in insect cells. 

20 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, angiogenesis protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C maltosa, Hansenula polymorpha, 

25 Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarroma lipolytica. 

The angiogenesis protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, for example, for the creation of monoclonal 
antibodies, if the desired epitope is small, the angiogenesis protein may be fused to a carrier 

30 protein to form an immunogen. Alternatively, the angiogenesis protein may be made as a 
fusion protein to increase expression, or for other reasons. For example, when the 
angiogenesis protein is an angiogenesis peptide, the nucleic acid encoding the peptide may be 
linked to another nucleic acid for expression purposes. Fusion with detection epitope tags 
can be made, e.g., with FLAG, His 6, myc, HA, etc. 
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In one embodiment, the angiogenesis nucleic acids, proteins and antibodies of 
the invention are labeled By "labeled" herein is meant that a compound has at least one 
element, isotope or chemical compound attached to enable the detection of the compound In 
general, labels fall into three classes: a) isotopic labels, which may be radioactive or heavy 
5 isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags and c) colored 
or fluorescent dyes. The labels may be incorporated into the angiogenesis nucleic acids, 
proteins and antibodies at any position. For example, the label should be capable of 
producing, either directly or indirectly, a detectable signal. The detectable moiety may be a 
radioisotope, such as 3 H, l4 C, 32 P, 35 S, or l2 \ a fluorescent or chemiiuminescent compound, 

1 0 such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline 
phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for 
conjugating the antibody to the label may be employed, including those methods described by 
Hunter et al., Nature, 144:945 (1962); David et al, Biochemistry, 13:1014 (1974); Pain et 
al., J. Immunol. Meth., 40:219 (1981); and Nygren, J. Histochem. and Cytochem., 30:407 

15 (1982). 

Accordingly, the present invention also provides angiogenesis protein 
sequences. An angiogenesis protein of the present invention may be identified in several 
ways. "Protein" in this sense includes proteins, polypeptides, and peptides. As will be 
appreciated by those in the art, the nucleic acid sequences of the invention can be used to 

20 generate protein sequences. There are a variety of ways to do this, including cloning the 
entire gene and verifying its frame and amino acid sequence, or by comparing it to known 
sequences to search for homology to provide a frame, assuming the angiogenesis protein has 
an identifiable motif or homology to some protein in the database being used. Generally, the 
nucleic acid sequences are input into a program that will search all three frames for 

25 homology. This is done in a preferred embodiment using the following NCBI Advanced 

BLAST parameters. The program is blastx or blastn. The database is nr. The input data is as 
"Sequence in FASTA format". The organism list is "none". The "expect" is 10; the filter is 
default. The "descriptions" is 500, the "alignments" is 500, and the "alignment view" is 
pairwise. The "Query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap 

30 existence cost is 1 1 , per residue gap cost is 1 ; and the lambda ratio is .85 default. This 
results in the generation of a putative protein sequence. 

Also included within one embodiment of angiogenesis proteins are amino acid 
variants of the naturally occurring sequences, as determined herein- Preferably, the variants 
are preferably greater than about 75% homologous to the wild-type sequence, more 
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preferably greater than about 80%, even more preferably greater than about 85% and most 
preferably greater than 90%. In some embodiments the homology will be as high as about 93 
to 95 or 98%. As for nucleic acids, homology in this context means sequence similarity or 
identity, with identity being preferred. This homology will be determined using standard 
techniques well known in the art as are outlined above for the nucleic acid homologies. 

Angiogenesis proteins of the present invention may be shorter or longer than 
the wild type amino acid sequences. Thus, in a preferred embodiment, included within the 
definition of angiogenesis proteins are portions or fragments of the wild type sequences, 
herein. In addition, as outlined above, the angiogenesis nucleic acids of the invention may be 
used to obtain additional coding regions, and thus additional protein sequence, using 
techniques known in the art. 

In a preferred embodiment, the angiogenesis proteins are derivative or variant 
angiogenesis proteins as compared to the wild-type sequence. That is, as outlined more fully 
below, the derivative angiogenesis peptide will often contain at least one amino acid 
substitution, deletion or insertion, with amino acid substitutions being particularly preferred. 
The amino acid substitution, insertion or deletion may occur at any residue within the 
angiogenesis peptide. 

Also included within one embodiment of angiogenesis proteins of the present 
invention are amino acid sequence variants. These variants typically fail into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the angiogenesis 
protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant angiogenesis protein fragments having up to 
about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
angiogenesis protein amino acid sequence. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be selected 
which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed angiogenesis variants screened for 
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the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, for example, M13 
primer mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of 
angiogenesis protein activities. 
5 Amino acid substitutions are typically of single residues; insertions usually 

will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 
some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 

10 arrive at a final derivative. Generally these changes are done on a few arnino acids to 

minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the angiogenesis protein are 
desired, substitutions are generally made in accordance with the amino acid substitution chart 
provided in the definition section. 

1 5 Substantial changes in function or immunological identity are made by 

selecting substitutions that are less conservative than those provided in the definition of 
"conservative substitution". For example, substitutions may be made which more 
significantly affect: the structure of the polypeptide backbone in the area of the alteration, for 
example the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the 

20 molecule at the target site; or the bulk of the side chain. The substitutions which in general 
are expected to produce the greatest changes in the polypeptide's properties are those in 
which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
residue, e.g. leucyl, isoleucyl, phenylalanyl, valyi or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side chain, 

25 e.g, lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g. 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occiirring analog, although variants also are 

30 selected to modify the characteristics of the angiogenesis proteins as needed. Alternatively, 
the variant may be designed such that the biological activity of the angiogenesis protein is 
altered. For example, glycosylation sites may be altered or removed. 

Covalent modifications of angiogenesis polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
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acid residues of an angiogenesis polypeptide with an organic derivatizing agent that is 
capable of reacting with selected side chains or the N-or C-terminal residues of an 
angiogenesis. polypeptide. Derivatization with bifiinctional agents is useful, for instance, for 
crosslinking angiogenesis polypeptides to a water-insoluble support matrix or surface for use 
5 in the method for purifying anti-angiogenesis polypeptide antibodies or screening assays, as 
is more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, 
esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl 
esters such as 3,3-dithiobis(succmimidylpropionate), bifiinctional maleimides such as bis-N- 

10 maleimido-l,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains [T.E. 

15 Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San 

Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C- 
terminal carboxyl group. 

Another type of covaient modification of the angiogenesis polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 

20 of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein 
to mean deleting one or more carbohydrate moieties found in native sequence angiogenesis 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence angiogenesis polypeptide. Glycosylation patterns can be altered in many ways. For 
example the use of different cell types to express angiogenesis-associated sequences can 

25 result in different glycosylation patterns. 

Addition of glycosylation sites to angiogenesis polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, for 
example, by the addition of, or substitution by, one or more serine or threonine residues to the 
native sequence angiogenesis polypeptide (for O-linked glycosylation sites). The 

30 angiogenesis amino acid sequence may optionally be altered through changes at the DNA 
level, particularly by mutating the DNA encoding the angiogenesis polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the 
angiogenesis polypeptide is by chemical or enzymatic coupling of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330 published 1 1 
September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem-, pp. 259-306 (1981). 
5 Removal of carbohydrate moieties present on the angiogenesis polypeptide 

may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et aL, Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al, Anal. Biochem., 118:131 

10 (1 98 1 ). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al., Meth. 
Enzymol., 138:350 (1987). 

Another type of covalent modification of angiogenesis comprises linking the 
angiogenesis polypeptide to one of a variety of nonproteinaceous polymers, e.g., 

15 polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Angiogenesis polypeptides of the present invention may also be modified in a 
way to form chimeric molecules comprising an angiogenesis polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of an angiogenesis polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the angiogenesis polypeptide. The 
presence of such epitope-tagged forms of an angiogenesis polypeptide can be detected using 
an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 

25 angiogenesis polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of an angiogenesis polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known in the 

art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 [Field et 
al, Mol Cell Biol, 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto [Evan et al, Molecular and Cellular Biology, 5:3610-3616 (1985)]; 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al, 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide 
[Hopp et al, BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et al t 
Science, 255:192-194 (1992)]; tubulin epitope peptide [Skinner et al, J. Biol Chem., 
5 266:1516345166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyennuth et al, 
Proc. Natl Acad Sci. USA, 87:6393-6397 (1990)]. 

Also included with an embodiment of angiogenesis protein are other 
angiogenesis proteins of the angiogenesis family, and angiogenesis proteins from other 
organisms, which are cloned and expressed as outlined below. Thus, probe or degenerate 

10 polymerase chain reaction (PCR) primer sequences may be used to find other related 

angiogenesis proteins from humans or other organisms. As will be appreciated by those in 
the art, particularly useful probe and/or PCR primer sequences include the unique areas of the 
angiogenesis nucleic acid sequence. As is generally known in the art, preferred PCR primers 
are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being 

15 preferred, and may contain inosine as needed. The conditions for the PCR reaction are well 
known in the art (e.g. , Innis, PCR Protocols, supra). 

In addition, as is outlined herein, angiogenesis proteins can be made that are 
longer than those encoded by the nucleic acids of the figures, e.g., by the elucidation of 
extended sequences, the addition of epitope or purification tags, the addition of other fusion 

20 sequences, etc. 

Angiogenesis proteins may also be identified as being encoded by 
angiogenesis nucleic acids. Thus, angiogenesis proteins are encoded by nucleic acids that 
will hybridize to the sequences of the sequence listings, or their complements, as outlined 
herein. 

25 In a preferred embodiment, when the angiogenesis protein is to be used to 

generate antibodies, e.g., for immunotherapy or immunodiagnosis, the angiogenesis protein 
should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 

30 made to a smaller angiogenesis protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. In a preferred 
embodiment, the epitope is selected from a protein sequence set out in Table 8. 
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Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
5 subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 

10 trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

1 5 antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 

20 immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
8 , or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 

25 glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, 
Academic Press, (1986) pp. 59-103], Immortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, 
rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the 

30 growth or survival of the unfused, immortalized cells. For example, if the parental cells lack 
the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine 
("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
5 protein encoded by a nucleic acid Tables 1-8 or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to angiogenesis protein are capable 

10 of reducing or eliminating a biological function of an angiogenesis protein, as is described 
below. That is, the addition of anti-angiogenesis protein antibodies (either polyclonal or 
preferably monoclonal) to angiogenic tissue (or cells containing angiogenesis) may reduce or 
eliminate the angiogenesis activity. Generally, at least a 25% decrease in activity is 
preferred, with at least about 50% being particularly preferred and about a 95-100% decrease 

1 5 being especially preferred. 

In a preferred embodiment the antibodies to the angiogenesis proteins are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labsjnc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 

20 Fab, Fab 1 , F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues form a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 

25 affinity and capacity. In some instances, Fv framework residues of the human 

immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 

30 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
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immunoglobulin [Jones et al., Nature, 321:522-525 (1986); Riechmann et aL, Nature, 
332:323-329 (1988); and Presta, Curr. Op. Struct. Biol, 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the ait 
Generally, a humanized antibody has one or more amino acid residues introduced into it from 
5 a source which is non-human. These non-human amino acid residues are often referred to as 
import residues, which are typically taken from an import variable domain. Humanization 
can be essentially performed following the method of Winter and co-workers [Jones et al., 
Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Veitioeyen et 
al., Science, 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 

10 corresponding sequences of a human antibody* Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 

1 5 analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et al, J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et 

20 al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and Boerner et 
al, J. Immunol., l47(l):86-95 (1991)]. Similarly, human antibodies can be made by 
introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 

25 humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 
This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 
5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: 
Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al., Nature 368 856-859 (1994); 
Morrison, Nature 368, 812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 

30 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. 
Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of angiogenesis with an antibody raised 
against angiogenesis proteins. As used herein, immunotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
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(patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinaiy skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
5 desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the angiogenesis proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 

10 antibodies used for treatment, bind and prevent the secreted protein from binding to its 
receptor, thereby inactivating the secreted angiogenesis protein. 

In another preferred embodiment, the angiogenesis protein to which antibodies 
are raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the angiogenesis protein and prevent it from 

15 binding to other proteins, such as circulating ligands or cell-associated molecules. The 

antibody may cause down-regulation of the transmembrane angiogenesis protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
angiogenesis protein. The antibody is also an antagonist of the angiogenesis protein. 

20 Further, the antibody prevents activation of the transmembrane angiogenesis protein. In one 
aspect, when the antibody prevents the binding of other molecules to the angiogenesis 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-a, TNF-p, EL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

25 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, angiogenesis is 
treated by administering to a patient antibodies directed against the transmembrane 
angiogenesis protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

30 otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated or fused to an 
effector moiety. The effector moiety can be any number of molecules, including labelling 
moieties such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In 
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one aspect the therapeutic moiety is a small molecule that modulates the activity of the 
angiogenesis protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the angiogenesis protein. The therapeutic 
moiety may inhibit enzymatic activity such as protease or collagenase activity associated with 
5 angiogenesis, or be an attractant of other cells, such as NK cells. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to angiogenesis tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with 
angiogenesis. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against angiogenesis 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

15 the antibody. Targeting the therapeutic moiety to transmembrane angiogenesis proteins not 
only serves to increase the local concentration of therapeutic moiety in the angiogenesis 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

In another preferred embodiment, the angiogenesis protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
or fused to a protein which facilitates entry into the cell. In one case, the antibody enters the 
cell by endocytosis. In another embodiment, a nucleic acid encoding the antibody is 
administered to the individual or cell. Moreover, wherein the angiogenesis protein can be 
targeted within a cell, i.e., the nucleus, an antibody thereto contains a signal for that target 

25 localization, i.e., a nuclear localization signal. 

The angiogenesis antibodies of the invention specifically bind to angiogenesis 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
K<j of at least about 0. 1 mM, more usually at least about 1 jiM, preferably at least about 0. 1 
pM or better, and most preferably, 0.01 \M or better. Selectivity of binding is also 

30 important. 

In a preferred embodiment, the angiogenesis protein is purified or isolated 
after expression. Angiogenesis proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
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and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the angiogenesis protein 
may be purified using a standard anti-angiogenesis protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
5 general guidance in suitable purification techniques, see Scopes, IL, Protein Purification, 
Springer- Verlag, NY (1982). The degree of purification necessary will vary depending on 
the use of the angiogenesis protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the angiogenesis proteins and 
nucleic acids are useful in a number of applications. They may be used as immunoselection 
10 reagents, as vaccine reagents, as screening agents, etc. 

Detection of angiogenesis sequence for diagnostic and therapeutic applications 

In one aspect, the RNAexpression levels of genes are determined for different 
cellular states in the angiogenesis phenotype. Expression levels of genes in normal tissue 

15 (i'.e., not undergoing angiogenesis) and in angiogenesis tissue (and in some cases, for varying 
severities of angiogenesis that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 
essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 

20 generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or angiogenesic tissue. This will provide 

25 for molecular diagnosis of related conditions. 

'Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g., 

30 normal versus angiogenic tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more statese. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
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expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
5 GeneChip™ expression arrays, Lockhart, Nature Biotechnology, 14: 1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, Northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 

10 preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g., with antibodies to the angiogenesis 

1 5 protein and standard immunoassays (ELIS As, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to 
angiogenesis genes, Le., those identified as being important in an angiogenesis phenotype, 
can be evaluated in an angiogenesis diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 

20 simultaneously on a number of genes. Multiple protein expression monitoring can be 

performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. The assays are further described below in the example. PCR techniques can 

25 be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the angiogenesis protein are 
detected. Although DNA or RNA encoding the angiogenesis protein may be detected, of 
particular interest are methods wherein an mRNA encoding an angiogenesis protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

30. complementary to and hybridizes with the mRNA and includes, but is not limited to, 

oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
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detected In another method detection of the mRNA is performed in situ. In this method 
penneabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
5 digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding 
an angiogenesis protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
1 0 as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The angiogenesis proteins, antibodies, nucleic acids, modified proteins and cells 
containing angiogenesis sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
1 5 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, angiogenesis proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of angiogenesis. 
Detection of these proteins in putative angiogenesis tissue allows for detection or diagnosis of 

20 angiogenesis. In one embodiment, antibodies are used to detect angiogenesis proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the angiogenesis protein is 
detected, e.g., by immunoblotting with antibodies raised against the angiogenesis protein. 

25 Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the angiogenesis protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the angiogenesis protein(s). Following washing to remove non-specific 

30 antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
label. In another method the primary antibody to the angiogenesis protein(s) contains a 
detectable label, for example an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
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detectable label. This method finds particular use in simultaneous screening for a plurality of 
angiogenesis proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are alsoprovided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
5 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing 
angiogenesis from biological samples, such as blood, urine, sputum, or other bodily fluids. 
As previously described, certain angiogenesis proteins are secreted/circulating molecules. 

10 Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted angiogenesis proteins. Antibodies can be used to detect an angiogenesis protein by 
previously described immunoassay techniques including ELISA, immunoblotting (Western 
blotting), irnmunoprecipitation, BIACORE technology and the like. Conversely, the presence 
of antibodies may indicate an immune response against an endogenous angiogenesis protein. 

15 In a preferred embodiment, in situ hybridization of labeled angiogenesis 

nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
angiogenesis tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 

20 findings. It is further understood that the genes which indicate the diagnosis may differ from 
those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
25 acids, modified proteins and cells containing angiogenesis sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to angiogenesis 
severity, in terms of long term prognosis. Again, this may be done on either a protein or gene 
level, with the use of genes being preferred. As above, angiogenesis probes may be attached 
to biochips for the detection and quantification of angiogenesis sequences in a tissue or 
30 patient. The assays proceed as outlined above for diagnosis. PCR method may provide more 
sensitive and accurate quantification. 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The angiogenesis proteins, antibodies, 
nucleic acids, modified proteins and cells containing angiogenesis sequences are used in drug 
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screening assays or by evaluating the effect of drug candidates on a "gene expression profile" 
or expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
5 Zlokarnik, et al., Science 279, 84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the angiogenesis proteins, antibodies', nucleic 
acids, modified proteins and cells containing the native or modified angiogenesis proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the angiogenesis phenotype or an identified physiological 

10 function of an angiogenesis protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 

15 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level. That is, having identified a particular gene as up regulated in angiogenesis, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
angiogenic protein. 'Modulation" thus includes both an increase and a decrease in gene 

20 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing angiogenesis, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4- fold increase in angiogenic tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in 

25 angiogenic tissue compared to normal tissue often provides a target value of a 1 0-fold 
increase in expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the angiogenesis protein and standard 

30 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entitites, i e. , an expression profile, is monitored simultaneously. Such profiles will 
typically invove a plurality of those entitites described herein.. 
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In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microliter plate, may 
be used with dispensed primers in desired wells. A PCR reaction can then be performed and 
5 analyzed for each well. 

Modulators of angiogenesis 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more angiogenesis-associated sequences, e.g., a polynucleotide 
10 sequence set out in Tables 1-8 . Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate angiogenesis, modulate angiogenesis proteins, bind to an angiogenesis protein, 
or interfere with the binding of an angiogenesis protein and an antibody or other binding 
partner. 

15 The term "test compound" or "drug candidate" or "modulator" or grammatical 

equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the angiogenesis phenotype or the expression of an angiogenesis sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 

20 expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses an angiogenesis phenotype, for example to a normal 
tissue fingerprint. In another embodiment, a modulator induced an angiogenesis phenotype. 
Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 

25 concentrations serves as a negative control, le., at zero concentration or below the level of 
detection. 

In one aspect, a modulator will neutralize the effect of an angiogenesis protein. 
By Neutralize" is meant that activity of a protein is inhibited or blocked and thereby has 
substantially no effect on a cell. 
30 In certain embodiments, combinatorial libraries of potential modulators will be 

screened for an ability to bind to an angiogenesis polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
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and activity of those variant compounds. . Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
5 compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 
A combinatorial chemical library is a collection of diverse chemical 

1 0 compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical '*building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide (e.g. t mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 

1 5 compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop et al. (1994) Med. Chem. 37(9): 1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Int. J. Pept. Prot. Res., 

20 37: 487-493, Houghton et al (1991) Nature, 354: 84-88), peptoids (PCT Publication No WO 
91/19735, 26 Dec. 1991), encoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), 
random bio-oligomers (PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. 
Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs 
et al, (1993) Proa Nat Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara 

25 et al. (1992) /. Amer. Chem. Soc. 1 14: 6568), nonpeptidal peptidomimetics with a Beta-D- 
Glucose scaffolding (Hirschmann et al., (1992) /. Amer. Chem. Soc. 1 14: 9217-9218), 
analogous organic syntheses of small compound libraries (Chen et al. (1994) J, Amer. Chem. 
Soc. 116: 2661), oligocarbamates (Cho, et al., (1993) Science 261:1303), and/or peptidyl 
phosphonates (Campbell et al., (1994) J. Org. Chem. 59: 658). See, generally, Gordon et al, 

30 (1994) J. Med. Chem. 37:1385, nucleic acid libraries (see, e.g., Strategene, Coip.), peptide 
nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn 
et al. (1996) Nature Biotechnology, 14(3): 309-314), and PCT/US96/10287), caibohydrate 
libraries (see, e.g., Liang et al, (1996) Science, 274: 1520-1522, and U.S. Patent No. 
5,593,853), and small organic molecule libraries (see, e.g., benzodiazepines, Baum (1993) 
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C&EN, Jan 18, page 33; isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 
5 Devices for the preparation of combinatorial libraries are commercially 

available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
10 solution phase chemistries. These systems include automated workstations like the 

automated synthesis apparatus developed by Takeda Chemical Industries, LTD, (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
1 5 with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
20 Columbia, MD,etc). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of angiogenesis gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

25 High throughput assays for the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
for example, U.S. Patent No. 5,559,410 discloses high throughput screening methods for 
proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 

30 acid binding (Le., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for hgand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
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typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detectors) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 
5 detailed protocols for various high throughput systems. Thus, for example, Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, eg., cellular extracts containing 

1 0 proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Paticularly useful test compound will be directed to the class of proteins to which 

15 the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or biased" random peptides. By 

20 "randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 

25 most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 

30 limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, for example, of hydrophobic 
amino acids, hydrophilic residues, sterically biased (either small or large) residues, towards 
the creation of nucleic acid binding domains, the creation of cysteines, for cross-linking, 
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prolines for SH-3 domains, serines, threonines, tyrosines or histidines for phosphorylation 
sites, etc., or to purines, etc. 

Modulators of angiogenesis can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
5 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

1 0 After the candidate agent has been added and the cells allowed to incubate for 

some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 

15 an in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, for example, a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 

20 such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 

appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 

25 streptavidin is labeled as described above, thereby, providing a detectable signal for the 

bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

30 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,1 18, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 
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A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target Stringency can be controlled by 
5 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
10 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
15 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
20 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the angiogenesis phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
25 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
30 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent After identifying a modulator based upon its ability to suppress an 
angiogenesis expression pattern leading to a normal expression pattern, or to modulate a 
single angiogenesis gene expression profile so as to mimic the expression of the gene from 
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normal tissue, a screen as described above can be performed to identity genes that are 
specifically modulated in response to the agent Comparing expression profiles between 
normal tissue and agent treated angiogenesis tissue reveals genes that are not expressed in 
normal tissue or angiogenesis tissue, but are expressed in agent treated tissue. These agent- 
5 specific sequences can be identified and used by methods described herein for angiogenesis 
genes or proteins. In particular these sequences and the proteins they encode find use in 
marking or identifying agent treated cells. In addition, antibodies can be raised against the 
agent induced proteins and used to target novel therapeutics to the treated angiogenesis tissue 
sample. 

10 Thus, in one embodiment, a test compound is administered to a population of 

angiogenic cells, that have an associated angiogenesis expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 

15 encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
20 washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, for example, angiogenesis tissue may be screened for agents that 
modulate, e.g., induce or suppress the angiogenesis phenotype. A change in at least one 
25 gene, preferably many, of the expression profile indicates that the agent has an effect on 
angiogenesis activity. By defining such a signature for the angiogenesis phenotype, screens 
for new drugs that alter the phenotype can be devised. With this approach, the drug target 
need not be known and need not be represented in the original expression screening platform, 
nor does the level of transcript for the target protein need to change. 
30 Measure of angiogenesis polypeptide activity, or of angiogenesis or the 

angiogenic phenotype can be performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the anagiogenesis polypeptides can be measured by 
examining parameters described above. A suitable physiological change that affects activity 
can be used to assess the influence of a test compound on the polypeptides of this invention. 
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When the functional consequences are determined using intact cells or animals, one can also 
measure a variety of effects such as, in the case of angiogenesis associated with tumors, 
tumor growth, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 
5 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian angiogenesis polypeptide is typically used, e.g., 
mouse, preferably human. 

A variety of angiogenesis assays are known to those of skill in the art. Various 
models have been employed to evaluate angiogenesis (eg., Croix et ai, Science 289:1 197- 

10 1202,2000 wdKahri etal.,Amer. J.Pathol. 156:1887-1900). Assessement of angiogenesis 
in the presence of a potential modulator of angiogenesis can be performed using cell-cultre- 
based angiogenesis assays, e.g., endothelial cell tube formation assays, as well as other 
bioassays such as the chick CAM assay, the mouse corneal assay, and assays measuring the 
effect of administering potential modulators on implanted tumors. The chick CAM assay is 

15 described by O'Reilly, et al Cell 79: 315-328, 1994. Briefly, 3 day old chicken embryos with 
intact yolks are separated from the egg and placed in a petri dish. After 3 days of incubation, 
a methylcellulose disc containing the protein to be tested is applied to the CAM of individual 
embryos. After about 48 hours of incubation, the embryos and CAMs are observed to 
determine whether endothelial growth has been inhibited. The mouse corneal assay involves 

20 implanting a growth factor-containing pellet, along with another pellet containing the 

suspected endothelial growth inhibitor, in the cornea of a mouse and observing the pattern of 
capillaries that are elaborated in the cornea. Angiogenesis can also be measured by 
determining the extent of neovascularization of a tumor. For example, carcinoma cells can be 
subcutaneously inoculated into athymic nude mice and tumor growth then monitored. The 

25 cancer cells are treated with an angiogenesis inhibitor, such as an antibody, or other 

compound that is exogenbusly administered, or can be transfected prior to inoculation with a 
polynucleotide inhibitor of angiogenesis. Immunoassays using endothelial cell-specific 
antibodies are typically used to stain for vascularization of tumor and the number of vessels 
in the tumor. 

30 Assays to identify compounds with modulating activity can be performed in 

vitro. For example, an angiogenesis polypeptide is first contacted with a potential modulator 
and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the angiogenesis polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
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ELIS A and the like with an antibody that selectively binds to the angiogenesis polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 
hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
5 detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the angiogenesis 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or p-gal. The reporter construct is typically transfected into a cell. After 

10 treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 

1 5 expression of the gene or the gene product itself can be done. The gene products of 

differentially expressed genes are sometimes referred to herein as "angiogenesis proteins". In 
preferred embodiments the angiogenesis protein comprises a sequence shown in Table 8. 
The angiogenesis protein may be a fragment, or alternatively, be the full length protein to a 
fragment shown herein. 

20 Preferably, the angiogenesis protein is a fragment of approximately 14 to 24 

amino acids long. More preferably the fragment is a soluble fragment. In one embodiment 
an angiogenesis protein is conjugated or fused to an immunogenic agent or BS A, 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 

25 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate strucutre activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 

30 isolated gene product is used; that is, the gene products of one or more differentially 

expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the angiogenesis proteins can be used in the assays. 
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Thus, in a preferred embodiment, the methods comprise combining an 
angiogenesis protein and a candidate compound, and determining the binding of the 
compound to the angiogenesis protein. Preferred embodiments utilize the human 
angiogenesis protein, although other mammalian proteins may also be used, for example for 
5 the development of animal models of human disease. In some embodiments, as outlined 
herein, variant or derivative angiogenesis proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the angiogenesis 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microliter plate, an array, etc.). The insoluble supports may be 

10 made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 

15 teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondifiusable. Preferred methods of binding include the use of antibodies 

20 (which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky*' or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 
areas may then be blocked through incubation with bovine serum albumin (BS A), casein or 

25 other innocuous protein or other moiety. 

In a preferred embodiment, the angiogenesis protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the angiogenesis protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

30 analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 
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The determination of the binding of the test modulating compound to the 
angiogenesis protein may be done in a number of ways. In a preferred embodiment, the 
compound is labelled, and binding determined directly, e.g> by attaching all or a portion of 
the angiogenesis protein to a solid support, adding a labelled candidate agent (e.g., a 
5 fluorescent label), washing off excess reagent, and deterniining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

By "labeled" herein is meant that the compound is either directly or indirectly 
labeled with a label which provides a detectable signal, e.g. radioisotope, fluoresces, 
enzyme, antibodies, particles such as magnetic particles, chemiluminescers, or specific 

10 binding molecules, etc. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the 
complementary member would normally be labeled with a molecule which provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or 
indirectly provide a detectable signal. 

15 In some embodiments, only one of the components is labeled, e.g., the 

proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., ns l for the proteinsand a fluorophor 
for the compound. Proximity reagents, e.g. , quenching or energy transfer reagents are also 
useful. 

20 In one embodiment, the binding of the test compound is determined by 

competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e. an angiogenesis protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 

25 embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
screening. Typically between 0. 1 and 1 hour will be sufficient. Excess reagent is generally 

30 removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is bindinjg 
to the angiogenesis protein and thus is capable of binding to, and potentially modulating, the 



64 



WO 02/079492 PCT/US02/04915 

activity of the angiogenesis protein. In this embodiment, either component can be labeled. 
Thus, for example, if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 
5 In an alternative embodiment, the test compound is added first, with 

incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the angiogenesis protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound is 

1 0 capable of binding to the angiogenesis protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activitity of the angiogenesis proteins. In 
this embodiment, the methods comprise combining an angiogenesis protein and a competitor 
in a first sample. A second sample comprises a test compound, an angiogenesis protein, and 

15 a competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the angiogenesis protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the angiogenesis protein. 

20 Alternatively, differential screening is used to identify drug candidates that 

bind to the native angiogenesis protein, but cannot bind to modified angiogenesis proteins. 
The structure of the angiogenesis protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of an 
angiogenesis protein are also identified by screening drugs for the ability to either enhance or 

25 reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 
control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 

30 and the amount of bound, generally labeled agent determined. For example, where a 

radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc, which may be used 
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to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order mat provides for the requisite binding. 
5 In a preferred embodiment, the invention provides methods for screening for a 

compound capable of modulating the activity of an angiogenesis protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising angiogenesis 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes an angiogenesis protein. In a preferred embodiment, a library of 

1 0 candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 

1 5 example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate angiogenesis agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the angiogenesis protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 

20 In one embodiment, a method of inhibiting angiogenic cell division is 

provided. The method comprises administration of an angiogenesis inhibitor. In another 
embodiment, a method of inhibiting angiogenesis is provided. The method comprises 
adrninistration of an angiogenesis inhibitor. In a further embodiment, methods of treating 
cells or individuals with angiogenesis are provided. The method comprises administration of 

25 an angiogenesis inhibitor. 

In one embodiment, an angiogenesis inhibitor is an antibody as discussed 
above. In another embodiment, the angiogenesis inhibitor is an antisense molecule. 

Polynucleotide modulators of angiogenesis 
30 Antisense Polynucleotides 

In certain embodiments, the activity of an angiogenesis-associated protein is 
downregulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding rnRNA 
nucleic acid sequence, e.g., an angiogenesis protein mRNA, or a subsequence thereof. 
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Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturaliy-occurring 
5 subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the angiogenesis protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

1 0 Such antisense polynucleotides can readily be synthesized using recombinant 

means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art 
Antisense molecules as used herein include antisense or sense 

i 5 oligonucleotides. Sense oligonucleotides can, e.g. , be employed to block trancription by 
binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for angiogenesis molecules. A preferred antisense 
molecule is for an angiogenesis sequences in Tables 1-8 , or for a ligand or activator thereof. 

20 Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, for example, Stein and Cohen (Cancer Res. 48:2659, 
1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

25 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of angiogenesis-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 
30 been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto et al (1994) Adv. in Pharmacology 25: 
289-3 1 7 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al. 
(1990) Nucl Acids Res. IS: 299-304; Hampel et al. (1990) European Patent Publication No. 0 
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360 257; U.S. Patent No. 5,254,678. Methods of preparing are well known to those of skill in 
the art (see, e.g., Wong-Staal et a/., WO 94/26877; Ojwang et al. (1993) Proc. Natl Acad. 
Sci. USA 90: 6340-6344; Yamada et al (1994) Human Gene Therapy 1: 39-45; Leavitt et al 
(1995) Proc. Natl Acad. ScL USA 92: 699-703; Leavitt et al (1994) Human Gene Therapy 5: 
5 1151-120; and Yamada et al (1994) Virology 205: 121-126). 

Polynucleotide modulators of angiogenesis may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 

10 bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of 
angiogenesis may be introduced into a cell containing the target nucleic acid sequence, e.g. 9 

15 by formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating angiogenesis in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 

20 anti-angiogenesis antibody that reduces or eliminates the biological activity of an 

endogeneous angiogenesis protein. Alternatively, the methods comprise, administering to a 
cell or organism a recombinant nucleic acid encoding an angiogenesis protein. This may be 
accomplished in any number of ways. In a preferred embodiment, for example when the 
angiogenesis sequence is down-regulated in angiogenesis, such state may be reversed by 

25 increasing the amount of angiogenesis gene product in the cell. This can be accomplished, 
e.g., by overexpressing the endogeneous angiogenesis gene or administering a gene encoding 
the angiogenesis sequence, using known gene-therapy techniques, for example. In a 
preferred embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), for example as described 

30 in PCT/US93/03868, hereby incorporated by reference in its entireity. Alternatively, for 
example when the angiogenesis sequence is up-regulated in angiogenesis, the activity of the 
endogeneous angiogenesis gene is decreased, for example by the administration of a 
angiogenesis antisense nucleic acid or other inhibitor, such as RNAi. 
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In one embodiment, the angiogenesis eproteins of the present invention may 
be used to generate polyclonal and monoclonal antibodies to angiogenesis proteins. 
Similarly, the angiogenesis proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify angiogenesis 
5 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a angiogenesis protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The angiogenesis 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
angiogenesis proteins. The antibodies may also be used as blocking polypeptides, as outlined 
10 above, since they will specifically bind to the angiogenesis protein. 

Methods of identifying variant angiogenesis-associated sequences 

Without being bound by theory, expression of various angiogenesis sequences 
is correlated with angiogenesis. Accordingly, disorders based on mutant or variant 

15 angiogenesis genes may be determined. In one embodiment, the invention provides methods 
for identifying cells containing variant angiogenesis genes, e.g., determining all or part of the 
sequence of at least one endogeneous angiogenesis genes in a cell. This may be 
accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the angiogenesis genotype of an individual, e.g, 

20 determining all or part of the sequence of at least one angiogenesis gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced angiogenesis gene to a known angiogenesis gene, 
i.e., a wild-type gene. 

25 The sequence of all or part of the angiogenesis gene can then be compared to 

the sequence of a known angiogenesis gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 
embodiment, the presence of a a difference in the sequence between the angiogenesis gene of 
the patient and the known angiogenesis gene correlates with a disease state or a propensity 

30 for a disease state, as outlined herein. 

In a preferred embodiment, the angiogenesis genes are used as probes to 
determine the number of copies of the angiogenesis gene in the genome. 

In another preferred embodiment, the angiogenesis genes are used as probes to 
determine the chromosomal localization of the angiogenesis genes. Information such as 
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chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the 
angiogenesis gene locus. 

5 Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of an angiogenesis protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 

1 0 known techniques (e.g. , Ansel et al. , Pharmaceuitcal Dosage Forms and Drug Delivery, 
Lippincott, Williams & Wilkins Publishers, ISBN:0683305727; Lieberman (1992) 
Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 0824769 18X, 
0824712692, 0824716981; Lloyd (1999) The Art, Science and Technology of Pharmaceutical 
Compounding, Amer. Pharmacutical Assn, ISBN 0917330889; and Pickar (1999) Dosage 

15 Calculations, Delmar Pub, ISBN 0766805042). As is known in the art, adjustments for 
angiogenesis degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

20 A "patient" for the purposes of the present invention includes both humans and 

other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the angiogenesis proteins and modulators thereof of the 

25 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, for example, in the treatment of wounds and inflammation, the angiogenesis 
proteins and modulators may be directly applied as a solution or spray. 

30 The pharmaceutical compositions of the present invention comprise an 

angiogenesis protein in a form suitable for adininistration to a patient. In the preferred 
embodiment, the pharmaceutical compositions are in a water soluble form, such as being 
present as pharmaceutically acceptable salts, which is meant to include both acid and base 
addition salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain 
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the biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
5 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

10 potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diemylarnine, triethylamine, 
tripropylamine, and ethanolamine. 

1 5 The pharmaceutical compositions may also include one or more of the 

following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol. 

The pharmaceutical compositions can be administered in a variety of unit 

20 dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that angiogenesis protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 

25 molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art 

The compositions for administration will commonly comprise an angiogenesis 
protein modulator dissolved in a pharmaceutically acceptable carrier, preferably an aqueous 

30 carrier, A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
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and the like, for example, sodium acetate, sodium chloride, potassium chloride, calcium 
chloride, sodium lactate and the like. The concentration of active agent in these formulations 
can vary widely, and will be selected primarily based on fluid volumes, viscosities, body 
weight and the like in accordance with the particular mode of administration selected and the 
5 patient's needs (e.g. , Remington s Pharmaceutical Science, 1 5th ed., Mack Publishing 
Company, Easton, Pennsylvania (1980) and Goodman and Gillman, The Pharmacologial 
Basis ofTherapeutics,(Hardman, J.G, Limbird, L.E, Molinoff; P.B., Ruddon, R.W, and 
Gilman, A.G.,eds) TheMcGraw-Hill Companies, Inc., 1996). 

Thus, a typical pharmaceutical composition for intravenous administration 

10 would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 

15 in the art, e.g., Remington 's Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Therapeutics, supra. 

The compositions containing modulators of angiogenesis proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 

20 amount sufficient to cure or at least partially arrest the disease and its complications. An 

amount adequate to accomplish this is defined as a 'therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 

25 the composition should provide a sufficient quantity of the agents of this invention to 

effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 

30 factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 
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It will be appreciated that the present angiogeriesis protein-modulating 
compounds can be administered alone or in combination with additional angiogenesis 
modulating compounds or with other therapeutic agent, eg., other anti-cancer agents or 
treatments. 

5 In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 

comprising nucleic acid sequences set forth in Tables 1-8 , such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of angiogenesis-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 

1 0 organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 

1 5 plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger and BCimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 Academic Press, Inc., San Diego, CA (Berger), F.M. Ausubel et al, eds., Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & 

20 Sons, Inc., (supplemented through 1999), and Sambrook et al, Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989. 

In a preferred embodiment, angiogenesis proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
.25 angiogenesis genes (including both the full-length sequence, partial sequences, or regulatory 
sequences of the angiogenesis coding regions) can be administered in a gene therapy 
application. These angiogenesis genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

30 Angiogenesis polypeptides and polynucleotides can also be administered as 

vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
compositions can include, for example, lipidated peptides (e.g.,Vitiello, A. et al, J. Clin. 
Invest. 95:341, 1 995), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) 
("PLG") microspheres (see, e.g., Eldridge, etal,Molec. Immunol 28:287 '-294, 1991: Alonso 
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etal, Vaccine 12:299-306, 1994; Jones etal, Vaccine 13:675-681, 1995), peptide 
compositions contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et 
al, Nature 344:873-875, 1990; Hu et ah, Clin Exp Immunol 1 13:235-243, 1998), multiple 
antigen peptide systems (MAPs) (see e.g., Tarn, J. P., Proc. Natl. Acad. Scl U.S.A. 85:5409- 
5 5413, 1988; Tarn, J.P., J. Immunol. Methods 196: 17-32, 1996), peptides formulated as 
multivalent peptides; peptides for use in ballistic delivery systems, typically crystallized 
peptides, viral delivery vectors (Perkus, M. E. et al, In: Concepts in vaccine development, 
Kaufinann, S. H. E., ed, p. 379, 1996; Chakrabarti, S. et al, Nature 320:535, 1986; Hu, S. L. 
et al., Nature 320:537, 1986; Kieny, M.-P. et al., AIDS Bio/Technology 4:790, 1986; Top, F. 

10 H. et al. , J. Infect. Dis. 124:148, 1971; Chanda, P. K. et al., Virology 175:535, 1990), 
particles of viral or synthetic origin (e.g., Kofler, N. et al., J. Immunol. Methods. 192:25, 
1996; Eldridge, J. H. et al., Sem. Hematol 30:16, 1993; Falo, L. D., Jr. etal, Nature Med. 
7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol 
4:369, 1986; Gupta, R. K. etal, Vaccine 11:293, 1993), liposomes (Reddy, R. etal, J. 

15 Immunol 148:1585, 1992; Rock,K. L.Jmmunol Today 17:131, 1996), or, naked or particle 
absorbed cDNA (Ulmer, J. B. et al, Science 259: 1745, 1993; Robinson, H. L., Hunt, L. A., 
and Webster, R. G., Vaccine 1 1 :957, 1993; Shiver, J, W. et al, In: Concepts in vaccine 
development, Kaufinann, S. H. E., ed., p. 423, 1996; Cease, K. B„ and Berzofsky, J. A., 
Annu. Rev. Immunol. 12:923, 1994 and Eldridge, J. H. etal, Sem. Hematol. 30:16, 1993). 

20 Toxin-targeted delivery technologies, also known as receptor mediated targeting, such as 
those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, 
Detroit, Ml); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 
(SmithKline Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel 
(alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 

30 acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 
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Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient This approach is described, for instance, in Wolff et al 9 Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
5 WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun'') or pressure-mediated delivery 
{see, e.g. f U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 

10 invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, for example, as a vector to express nucleotide sequences that encode 
angiogenic polypeptides or polypeptide fragments. Upon introduction into a host, the 
recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an 

1 5 immune response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette 
Guerin). BCG vectors are described in Stover et a/., Nature 351:456-460 (1991). Awide 
variety of other vectors useful for therapeutic administration or immunization e.g. adeno and 
adeno-associated virus vectors, retroviral vectors, Salmonella typhi vectors, detoxified 

20 anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 

description herein (see, e.g., Shata et al (2000) Mol Med Today, 6: 66-71; Shedlock et al, J 
Leukoc Biol 68,:793-806, 2000; Hipp et al, In Vivo 14:571-85, 2000). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing an angiogenesis gene or portion of an angiogenesis gene under the control of a 

25 regulatable promoter or a tissue-specific promoter for expression in an angiogenesis patient. 
The angiogenesis gene used for DNA vaccines can encode full-length angiogenesis proteins, 
but more preferably encodes portions of the angiogenesis proteins including peptides derived 
from the angiogenesis protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from an angiogenesis gene. 

30 For example, angiogenesis-associated genes or sequence encoding subfragments of an 
angiogenesis protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 
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In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the angiogenesis polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 
5 In another preferred embodiment angiogenesis genes find use in generating 

animal models of angiogenesis. When the angiogenesis gene identified is repressed or 
diminished in angiogenesic tissue, gene therapy technology, e.g., wherein antisense RNA 
directed to the angiogenesis gene will also diminish or repress expression of the gene. 
Animal models of angiogenesis find use in screening for modulators of an angiogenesis- 

10 associated sequence or modulators of angiogenesis. Similarly, transgenic animal technology 
including gene knockout technology, for example as a result of homologous recombination 
with an appropriate gene targeting vector, will result in the absence or increased expression 
of the angiogenesis protein. When desired, tissue-specific expression or knockout of the 
angiogenesis protein may be necessary, 

15 It is also possible that the angiogenesis protein is overexpressed in 

angiogenesis. As such, transgenic animals can be generated that overexpress the 
angiogenesis protein. Depending on the desired expression level, promoters of various 
strengths can be employed to express the transgene. Also, the number of copies of the 
integrated transgene can be determined and compared for a determination of the expression 

20 level of the transgene. Animals generated by such methods find use as animal models of 
angiogenesis and are additionally useful in screening for modulators to treat angiogenesis or 
to evaluate a therapeutic entity. 

Kits for Use in Diagnostic and/or Prognostic Applications 

25 For use in diagnostic, research, and therapeutic applications suggested above, 

kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, angiogenesis-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative angiogenesis polypeptides or polynucleotides, small molecules 

30 inhibitors of angiogenesis-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(/.&, protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
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medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (eg., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
5 materials. 

The present invention also provides for kits for screening for modulators of 
angiogenesis-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: an angiogenesis-associated polypeptide or polynucleotide, reaction tubes, and 

10 instructions for testing angiogenic-associated activity. Optionally, the kit contains 

biologically active angiogenesis protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 

1 5 parameters in disease which may be identified in historical or outcome data. 

It is understood that the examples described above in no way serve to limit 
the true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
20 specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 

EXAMPLES 

25 Example 1: Tissue Preparation. Labeling Chips, and Fingerprints 
Purify total RNA from tissue using TRIzol Reagent 

Homogenize tissue samples in 1ml of TRIzol per 50mg of tissue using a 
Polytron 3 100 homogenizes The generator/probe used depends upon the tissue size. A 
generator that is too large for the amount of tissue to be homogenized will cause a loss of 

30 sample and lower RNA yield. TRIzol is added directly to frozen tissue, which is then 

homogenize. Following homogenization, insoluble material is removed by centrifugation at 
7500 x g for 15 min in a Sorvall superspeed or 12,000 x g for 10 rrrin. in an Eppendorf 
centrifuge at 4°C. The clear homogenate is transferred to a new tube for use. The samples 
may be frozen now at -60° to -70°C (and kept for at least one month). The homogenate is 
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mixed with 0.2ml of chloroform per 1ml of TRIzol reagent used in the original 
homogenization and incubated at room temp, for 2-3 minutes. The aqueous phase is then 
separated by centrifugation and transferred to a fresh tube and the RNA precipitated using 
isopropyl alcohol The pellet is isolated by centrifugation, washed, air-dried, resuspended in 
5 an appropriate volume of DEPC H 2 0, and the absorbance measured. 

Purification of poly A+ mRNA from total RNA is performed as follows. Heat 
an oligotex suspension to 37°C and mixing immediately before adding to RNA. The 
Elution Buffer is heated at 70°C. Warm up 2 x Binding Buffer at 65°C if there is precipitate 
in the buffer. Mix total RNA with DEPC-treated water, 2 x Binding Buffer, and Oligotex 

1 0 according to Table 2 on page 1 6 of the Oligotex Handbook. Incubate for 3 minutes at 65°C. 
Incubate for 10 minutes at room temperature. Centrifuge for 2 minutes at 14,000 to 18,000 g. 
Remove supernatant without disturbing Oligotex pellet. A little bit of solution can be left 
behind to reduce the loss of Oligotex. Gently resuspend in Wash Buffer OW2 and pipet onto 
spin column. Centrifuge the spin column at full speed for 1 minute. Transfer spin column to 

15 a new collection tube and gently resuspend in Wash Buffer OW2 and centrifuge as describe 
herein. Transfer spin column to a new tube and elute with 20 to 100 ul of preheated (70oC) 
Elution Buffer. Gently resuspend Oligotex resin by pipetting up and down. Centrifuge as 
above. Repeat elution with fresh elution buffer or use first eluate to keep the elution volume 
low. Read absorbance, using diluted Elution Buffer as the blank. Before proceeding with 

20 cDNA synthesis, precipitate the mRNA as follows: add 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. 
of cold 100% ethanoL Precipitate at -20oC 1 hour to overnight (or 20-30 min. at -70oC). 
Centrifuge at 14,000-16,000 x g for 30 minutes at 4oC. Wash pellet with 0.5ml of 
80%ethanol (-20oC) then centrifuge at 14,000-16,000 x g for 5 minutes at room temperature- 
Repeat 80% ethanoi wash. Air dry the ethanol from the pellet in the hood.. Suspend pellet in 

25 DEPC H 2 0 at lug/ul concentration. 

To further Clean up total RNA using Qiagen's RNeasy kit, add no more than 
lOOug to an RNeasy column. Adjust sample to a volume of lOOul with RNase-free water. 
Add 350ul Buffer RLT then 250ul ethanol (100%) to the sample. Mix by pipetting (do not 
centrifuge) then apply sample to an RNeasy mini spin column. Centrifuge for 15 sec at 

30 >10,000rpm. Transfer column to a new 2-ml collection tube. Add 5 OOul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough then centrifuge for 2 min at 
maximum speed to dry column membrane. Transfer column to a new 1.5 -ml collection tube 
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and apply 30-50ul of RNase-free water directly onto column membrane. Centrifuge 1 min at 
>10,000rpm. Repeat elution. and read absorbance. 

cDNA synthesis using Gibco's "Superscript Choice System for cDNA Synthesis" kit 
5 First Strand cDNA synthesis is performed as follows. Use 5ug of total RNA 

or lug of poly A+ mRNA as starting material. For total RNA, use 2ul of Superscript RT. For 
polyA+ mRNA, use lul of Superscript RT. Final volume of first strand synthesis mix is 
20uL RNA must be in a volume no greater than lOul. Incubate RNA with lul of lOOpmol 
T7-T24 oligo for 10minat70C. On ice, add 7 ul of: 4ul5Xlst Strand Buffer, 2ul of 0.1 M 

10 DTT, and 1 ul of lOmM dNTP mix. Incubate at 37C for 2 min then add Superscript RT. 
Incubate at 37C for 1 hour. 

For the second strand synthesis, place 1st strand reactions on ice and add: 9 lul 
DEPC H 2 0; 30ul 5X 2nd Strand Buffer, 3ul lOmM dNTP mix; lul lOU/ul E.coli DNA 
Ligase; 4ul lOU/ul E.coli DNA Polymerase; and lul 2U/ul RNase H. Mix and incubate 2 

15 hours at 16C. Add 2ul T4 DNA Polymerase. Incubate 5 min at 16C. Add lOul of 0.5M 
EDTA. A further clean-up of DNA is performed using phenol:chloroform:isoamyl Alcohol 
(25:24:1) purification. 

In vitro Transcription (TVT) and labeling with biotin is performed as follows: 
Pipet 1 .5ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2ul T7 

20 lOxATP (75mM) (Ambion); 2ul T7 lOxGTP (75mM) (Ambion); 1.5ul T7 lOxCTP (75mM) 
(Ambion); 1.5ul T7 lOxUTP (75mM) (Ambion); 3.75ul lOmM Bio-1 1-UTP (Boehringer- 
Mannheim/Roche or Enzo); 3.75ul lOmM Bio-16-CTP (Enzo); 2ul lOx T7 transcription 
buffer (Ambion); and 2ul lOx T7 enzyme mix (Ambion). The final volume is 20ul. Incubate 
6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 

25 Fragmentation is performed as follows. 1 5 ug of labeled RNA is usually 

fragmented. Try to rninimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 

30 buffer is 200 mM Tris-acetate, pH 8. 1 ; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 
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For hybridization, 200 ul (lOug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50ng/ul final cone); 50 pM 948-b control 
5 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; O.lmg/ml herring sperm DNA; 
0.5mg/mi acetylated BSA; and 300 ul with lxMES hyb buffer. 

Labeling is performed as follows: The hybridization reaction includes non- 
biotinylated IVT (purified by RNeasy columns); IVT antisense RNA 4 ng:ul; random 
Hexamers (1 ug/ul) 4 fal and water to 14 ul. The reaciton is incubated at 70°C, 10 min. 

10 Reverse transcriptionis performed in the following reaction: 5X First Strand (BRL) buffer, 6 
ul; 0. 1 M DTT, 3 ul; 50X dNTP mix, 0.6 ul; H 2 0, 2.4 ul; Cy3 or Cy5 dUTP (lmM), 3 ul; SS 
RT EE (BRL), 1 ul in a final volume of 16 ul. Add to hybridization reaction. Incubate 30 
min., 42°C. Add 1 ul SSII and incubate another hour. Put on ice. 50X dNTP mix (25mM of 
cold dATP, dCTP, and dGTP, lOmM of dTTP: 25 ul each of lOOmM dATP, dCTP, and 

15 dGTP; 10 ul of lOOmM dTTP to 15 ul H20. dNTPs fiom Pharmacia) 

RNA degradation is performed as follows. Add 86 ul H20, 1.5 ul 1M NaOH/ 
2mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 ul TE/sample spin at 7000g 
for 1 0 min, save flow through for purificatioa For Qiagen purification, suspend u-con 
recovered material in 500ul buffer PB and proceed using Qiagen protocol. For DNAse 

20 digestion, add 1 ul of 1/100 dil of DNAse/30ul Rx and incubate at 37°C for 15 min. Incubate 
at 5 min 95°C to denature the DNAse/ 

For sample preparation, add Cot-1 DNA, 10 ul; 50X dNTPs, 1 ul; 20X SSC, 
2.3 ul; Na pyro phosphate, 7.5 ul; lOmg/ml Herring sperm DNA; lul of 1/10 dilution to 21.8 
final vol. Dry in speed vac. Resuspendin 15 ulH20. Add 0.38 ul 10% SDS. Heat95°C,2 

25 min and slow cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. 
Washing after the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 mis 20X SSC-K).75mls 
10% SDS in 250mls H20; IX SSC: 5 min., 12.5 mis 20X SSC in 250mls H20; 0.2X SSC: 5 
min., 2.5 mis 20X SSC in 250mls H20. Dry slides and scan at appropiate PMT's and 
channels. 

30 

Example 2. A model of angiogenesis is used to determine expression in angiogenesis 

In the model of angiogenesis used to determine expression of angiogenesis- 
associated sequences, human umbilical vein endothelial cells (HUVEC) were obtained, e.g.> 
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as passage 1 (pi) frozen cells from Cascade Biologies (Oregon) and grown in maintenance 
medium: Medium 199 (Life Technologies) supplemented with 20% pooled human serum, 
100 mg/ml heparin and 75 mg/ml endothelial cell growth supplements (Sigma) and 
gentamicin (Life Technologies). An in vitro cell system model was used in which 2x10 s 
5 HUVECs were cultured in 0.5 ml 3 mgs/ml plasminogen-depleted fibrinogen (Calbiochem, 
San Diego, CA) that was polymerized by the addition of 1 unit of maintenance medium 
supplemented with 100 ng/ml VEGF and HGF and 10 ng/ml TGF-a (R&D Systems, 
Minneapolis,MN) added (growth medium). The growth medium was replaced every 2 days. 
Samples for RNA were collected, e.g., at 0, 2, 6, 15, 24, 48, and 96 hours of culture. The 

10 fibrin clots were placed in Trizol (Life Technologies) and disrupted using a Tissuemizer. 
Thereafter standard procedures were used for extracting the RNA (e.g., Example 1). 

Angiogenesis associated sequences thus identified are shown in Tables 1-8 . 
As indicated, some of the Accession numbers include expression sequence tags (ESTs). 
Thus, in one embodiment herein, genes within an expression profile, also termed expression 

1 5 profile genes, include ESTs and are not necessarily full length. 
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TABLE 1: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Pkey: Unique Eos piobeset Identifier number 

Accession: Accession number used for previous patent Kings 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unlgene number 

Unlgene Title: Unlgene gene title 



Ptey Accession ExAccn UnlgenelD UnlgeneTWe 



134404 
121443 
100082 
132817 
130150 
100104 
130839 
4Z7064 
100113 
133980 
100129 
100154 
100169 
101956 
100190 
134742 
100211 
100238 
130283 
134237 
100248 
100255 
100262 
134329 
100281 
100294 
100327 
100335 
134495 
100338 
135152 
134269 
100372 
134304 
100394 
100405 
100418 
133154 
134347 
128653 
100438 
134593 
55 100481 
100552 
100591 
ods 

100652 

60 100662 
100899 
100905 
100945 
100950 

65 100964 
135407 
130149 
131877 
101016 

70 134786 
134100 
134078 
101132 
134849 

75 106432 



AB000450 

AB002380 

AB003103 

AB004884 

AF000573_ma1 

AF008937 

AF009301 

ARJ09368 

D00591 

D00760 

D11139 

D14657 

014878 

D17716 

D21090 

026135 

D26528 

D30742 

031762 

D31765 

D31888 

D36128 

D38500 

D38551 

D42087 

049396 

D55640 

063391 

D63477 

D63483 

D64015 



AB000450 

AF180681 

AA130080 

N27852 

BE094848 

AF008937 

AB011169 

AF029674 

NMJM1269 

AA294921 



H6O720 

AL037228 

NMJ30241Q 

M91401 

NM_001346 

026528 

L24959 

NM.012288 

031765 

NM.015156 

D25418 

D38500 

N 92036 

AF091035 

AA331881 

DS5640 

AW247529 

D63477 

086664 



D79997 
D80010 
D84276 
D86425 
086978 
D87012 
D87075 
087432 
087448 
D87845 

HG1Q98*T1098 
HG2167-HT2237 
HG241&KT2511 



NM.014737 

NM.014791 

BE613486 

D84284 

AW291587 

D86978 

087012 

AF164142 

D87432 

AA013051 

NM.000437 

X70377 

AA019521 

NM.004091 



HG2825^T2949 BE613608 
HG2887-HT3031_f 
HG4660W5O73 AL039123 
HG47044fT5146 L12260 
HG884-HT884 AF002225 
HG919-HT919 AF128542 
J00212J J00212 



J04029 
J04031 
J04088 
J04543 
L06139 
L07540 



J04029 
AW067805 



L11239 
L11353 
L13773 



J04543 

T29618 

AA460085 

L08895 

L11239 

BE409525 

AK000310 



Hs,82771 

Hs.6562 

Hs.4295 

Hs.57553 

Hs.15113 

Hs.102178 

Hs.20141 

Hs.173422 

H3.84746 

Hs.250811 

Hs.5831 

Hs.81892 

Hs.82043 

Hs.121502 

Hs.178658 

Hs.89462 

Hs.123058 

Hs.348 

Hs. 153954 

Hs.170114 

Hs.78398 

Hs^93 

Hs.278468 

Hs.81848 

Hs.184627 

Hs.75454 

Hs.6793 

Hs.84087 

Hs.57735 

Hs,182741 

Hs.80905 

H9.184339 

Hs.81412 

Hs.66052 

Hs.82733 

Hs.84790 

Hs.194665 

Hs.82042 

Hs.10315 

Hs.91417 

Hs.234392 

Hs.121489 

Hs.301946 

Hs.231444 

Hs.142653 
AI368680 
Hs.103042 
Hs.172816 
Hs.180686 
Hs.166846 

Hs.99936 

Hs.172665 

Hs.156346 

Hs.78637 

Ms.89640 

Hs.171075 

Hs.78995 

Hs.36993 

Hs.902 

Hs.17138 



vaccinia related kinase 2 

Rho guanine exchange factor (GEF) 12 

proteasome (prosome, macropain) 26S subunit, non-ATPaso, 12 

tousJed-Bke kinase 2 

hornogenfJsate 1,2-dicxygenase (homogentisata oxidase) 
syntaxin 16 

similar to S. cerevislae SSM4 
KIAA1605 protein 
chromosome condensation 1 

v-rai simian leukemia viral oncogene homotog B (ras related; GTP binding protein) 
tissue inhibitor of metaltoprotelnase 1 (erythroid potentjaflng activity, ccDagenase inhibitor) 
WAA01 01 gene product 
0123 gene product 

mannosyl (afpha-1,6-)-giycoprotein beta-1 t 6^c»ty^fucosaminyttraf^erase 
RAD23 (S. cerevislae) homotog B 
diacylgr/ceroi kinase, gamma (90kO) 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 7 (RNA heUcase, 52kD) 
caldum/caJrnodutMependent protein kinase IV 
TRAM-Ske protein 
K1AA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacyclin) receptor OP) 
postmeiotic segregation Increased 2-Gke 4 
RAD21 (S.pombe) homotog 
KIAA01 18 protein 
peroxiredoxin 3 

gb:Human monocyte PABL (pseudoautosomal boundary-Ike sequence) mRNA, clone Mo2. 
plateiektdivating factor acetyihydrolase, isoform lb. gamma subunit (29k0) 
KIAA0143 protein 
acetyl LDL receptor; SREC 

T1A1 cytotoxic granule-associated RNA-binding protein-Eke 1 
Ras association (RaIGDS/AF-6) domain family 2 
KIAA0175 gene product 
iipinl 

C038 antigen (p45) 
nidogen2 
WAA0225 protein 
topoisomerase(DNA) III beta 

solute carrier family 23 (nudeobase transporters), member 1 

solute carrier family 7 {caflonfc amino add transporter, y+ system), member 6 

topoisomerase (DMA) II binding protein 

platetet-acfivafing factor acetyl hydrolase 2 (40kO) 

cystafJnD 



Homo sapiens, Similar to hypothetical protein PR01722, done MGC:15692, mRNA, complete 
ret finger protein 

Hs.816 SRY (sex determining region Y)-box 2 
i1B 



1 

ubiquin protein Bgase E3A (human papilloma vims E6-associaied protein, Angelman syndrome) 

polymerase (DMA directed), epsflon 

Empirically selected from AFFX single probeset 

keratin 10 (epidermorytic hyperkeratosis; keratosis palmarts et piantaris) 

methylenetetrahydrofolate dehydrogenase (MAD P+ dependent), methenyltetrahydrofblate 

topoisomerase (DMA) II alpha (170kD) 

annexlnA7 

TEK tyrosine kinase, endothelial (venous malformations, multiple cutaneous and mucosal) 
replication factor C (activator 1) 5 (36J>kD) 

MADS box transcription enhancer factor 2, polypeptide C (myocyte enhancer factor 2C) 
gastrulation brain homeo box 1 
neurofibroma 2 (bilateral acoustic neuroma) 
hypothetical protein RJ20303 
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101152 L13600 


A1984625 


Hs.9884 


spindle pole body protefo 


135397 114922 


1.14922 


Hs. 166563 


replication (actor C (activator 1) 1 (145kD) 


131687 L15189 


BE297635 


HS.3C69 


heat shock 7QVD protein 9B (mortaEn-2) 


101168 L15388 


NMJB5308 


Hsu211569 


G protein-coupled receptor kinase 5 


421155 L16895 


H87879 


Hs.102267 


rysyicoGuase 


101226 L27476 


AF083892 


Hs.75608 


fight junction protein 2 (zona ocdudens 2) 


133975 127624 


C18356 


Hs.235944 


tissue factor pathway Inhibitor 2 


134739 L32976 


NM.002419 


Hs.89449 


mitogen-actjvated protein kinase kinase kinase 11 


130155 L33404 


AA101043 


Hs.151254 


kaffikrebi 7 (chymotryptic, stratum comeum) 


440538 135263 


W76332 


Hs.79107 


mitogennactivated protein kinase 14 


132813 137347 


BE313625 


Hi57435 


solute canter tamOy 1 1 (proton-coupled divalent metal Ion transporters), member 2 


101294 L40371 


AF168418 


Hs,1 16784 


thyroid hormone receptor interacted 


101300 L40391 


BE535511 


Hs.74137 


transmembrane trafficking protein 


101310 L41607 


L41607 


Hs£34 


glucosamine (N-acetyl) transferase 2, l-branching enzyme 


130344 L77566 


AW250122 


Hs. 154879 


OiGeorge syndrome critical region gene DGSt; likely orthotog of mouse expressed sequence 2 


embryonic lethal 








101381 M13928 


AW675039 


Hs.1227 


airdnolevutlnate, delta-, dehydratase 


101668 M14016 


AW005903 


Hs.78601 


uroporphyrinogen decarboxylase 


133780 M14219 


AA557660 


Hs.76152 


decorin 


101396 M15796 


BE267931 


Hs.78996 


proliferating cell nuclear antigen 


101447 M21305 


M21305 




gbfluman alpha satellite and satellite 3 junction DNA sequence. 


101458 M22092 


M22092 




gb^uman neural ceD adhesion molecule (N-CAM) gene, exon SEC and partel cds. 


101470 M22898 


NNL0Q0546 


Hs,1846 


tumor protein p53 (U-Fraumeni syndrome) 


134604 M22995 


NM_002884 


Hs.865 


fWI A, member of RAS oncogene family 


101478 M23379 


NMJKI2890 


Hs.758 


RAS p21 protein activator (GTPase activating protein) 1 


406698 M24364 


X03068 


Hs.73931 


major histocompatibility complex, class II, DQ beta 1 


133519 M24400 


AW583062 


Hs.74502 


chymotrypsinogen B1 


131165 M25753 


BE28O074 


Hs.23960 


cydkiBI 


134116 M27691 


R84694 


Hs.79194 


cAMP responsive element binding protein 1 


133999 M28213 


AA535244 


Ks.78305 


RAB2, member RAS oncogene femCy 


130174 M29550 


M29551 


Hs.151531 


protein phosphatase 3 (formerly 2B), catalytic subunit, beta isoform (calclneurin A beta) 


129963 M29971 


M29971 


Hs.1384 


0^4nethylguanlne-DNA mefttf transferase 


132983 M30269 


M30269 


Hs.62041 


niobgen (enactln) 


133900 M31158 


M31158 


Hs.77439. 


protein kinase, cAMP-dependent, regulatory, type n, beta 


101543 M31166 


M31166 


Hs.2050 


pentaxtn-related gene, rapidly induced by IL-1 beta 


101545 M31210 


BE246154 


. Hs.154210 


endothelial differentiation, sphingollpld G-protesn-ooupled receptor, 1 


101620 M55420 


S55271 


Hs.247930 


EpsSon , IgE 


134691 M59979 


AW382987 


Hs.88474 


prostaglandins aoperoxide synthase 1 (prostaglandin G/H synthase and cyclooxygenase) 


133595 M62810 


AA393273 


Hs.75133 


transcription factor 6-Gke 1 (mitochondrial transcription factor 1-iike) 


130425 M63838 


AA243383 


Hs.155530 


Interferon, gamma -inducible protein 16 


101700 M64710 


D90337 


Hs.247916 


natriuretic peptide precursor C 


101714 M68874 


M68874 


Hs.211587 


ptosrjhoJipase A2, group IVA (cytosoDc, calcium-dependent) 


134246 M74524 


D28459 


Hs.80612 


ublquiiin-conjugating enzyme E2A (RAD6 homolog) 


101760 M80254 


M80254 


Hs.173125 


peptldytprotyl tsomerase F (cydophirm F) 


133948 M31780_cds3 


X59960 


Hs.77813 


sphingomyelin phosphodiesterase 1 , acid lysosomal (acid sphingomyelinase) 


101791 M83822 


M83822 


Hs.62354 


ceD division cycle 4-like 


101812 M86934 


BE439894 


Hs.78991 


DNA segment numerous copies, expressed probes (GS1 gene) 


101813 M87338 


NM__0G2914 


Hs.1 39226 


replication factor C (activator 1) 2 (40 kD) 


133396 M96326_ma1 


M96326 


Hs.72885 


azurocidin 1 (cationlc antimicrobial protein 37) 


135152 M96954 


M96954 


Hs.182741 


TIA1 cytotoxic granule-associated RNA-bfnding protein-like 1 


129026 M98833 


AL120297 * 


Hs.108043 


Friend leukemia virus integration 1 


101901 S66793 


H38026 


Hs.308 


anestin 3, retinal (X-arresfJn) 


134831 S72370 


AA853479 


Hs.89890 


pyruvate carboxylase 


134039 S78569 


NM-002290 


Hs.78672 


laminin, alpha 4 


134395 S79873 


AA456539 


Hs.8262 


lysosomal 


101975 S83325 


AA079717 


Hs.283664 


aspartate beta-hydraxylase 


101977 S83364 


AF112213 


Hs. 184062 


putative Rab5-interacfing protein 


101978 S83365 


BE561610 


H&5609 


putative transmembrane protein; homolog of yeast Golgi membrane protein Vrflp (Ylplp- 


Interacting factor/ 








101998 U01212 


U01212 


H3243153 


oifactory marker protein 


102003 U01922 


U01922 


Hs.125565 


transkxase of Inner mitochondrial membrane 8 (yeast) homolog A 


102007 U02556 


U02556 


HS75307 


t-comptex-assocfated-testls-expressed 1-Bke 


102009 U02680 


8E245149 


Hs.82643 


protein tyrosine kinase 9 


416658 U03272 


U03272 


Hs.79432 


Ebii 2 (congenita) contractural arachnodactyly) 


132951 U04209 


AW821182 


Hs.61418 


mlcrofibnTlar-assoclated protein 1 


135389 U05237 


U05237 


Hs.99872 


fetal Alzheimer anggen 


102048 U07225 


U07225 


Hs.339 


purinetglc receptor P2Y, G-protem coupled, 2 


130145 U07620 


U34820 


Hs.151051 


mitogen-acfivated protein kinase 10 


303153 U09759 


U09759 


Hs.246857 


mitogon-activated protein kinase 9 


420269 U09820 


U72937 


Hs.96264 


alpha thalassemla/mental retardation syndrome X-tinked (RAD 54 (S. cerevisiae) homolog)- 


102095 U11313 


U11313 


Hs.75760 


sterol carrier protein 2 


102123 U14518 


NM_001809 


Hs.1594 


centromere protein A (17kD) 


102126 U14575 


AW950870 


Hs.78961 


protein phosphatase 1, regulatory (inhibitor) subunit 8 


102133 U15173 


AU076845 


Hs.155596 


BCL2radenoviniS E1B 19kO-*nteracting protein 2 


102139 U15932 


NWL004419 


Hs.2128 


dual specificity phosphatase 5 


102162 U18291 


AA45Q274 


Hs.1592 


CDC 16 (cell division cycle 16. S. cerevisiae. homolog) 
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102164 


U 18300 


NMJXW107 


Hs. 77602 


damage-specific DNA binding protein 2 (48kD) 


427653 


U 18383 


M159001 


Hs.180069 


nuclear respiratory factor 1 


131817 


U 20536 


U 20536 


Hs32B0 


caspase 6, apoptoste-related cysteine protease 


102200 


U21551 


AA232362 


Hs. 1572)5 


branched chain aminotransferase 1, cytosoDc 


102210 


(J23028 


BE619413 


Hs.2437 


eukaryotte translafion Initiation factor 2S, sitbimlt 5 (epsflon, B2kD) 


102214 


U23752 


U23752 


HsJ32964 


SRY (sex determining region YJ-dox 11 


132811 


U25435 


U25435 


Hs£7419 


CCCTC-binding (actor (zinc finger protein) 


131319 


U25JB7 


N1VL003155 


H&25590 


stanntocaldn 1 


102258 


U2fl251„ods2 


U28251 


Hs^3237 


ESTs, Highly similar to Z169.J1UMAN ZINC FINGER PROTEIN 169 [M^apians] 


132316 


U28831 


U28831 


Hs.44566 


K1AA1641 protein 


102269 


U30245 


U30245 




gbHuman rrryebmonocyfec specffic protein (MNDAJ gene, 5 Hanking sequence ano compjera 


exon 1. 
134365 


U32315 


AA568906 


Hs.82240 


syntaxfn 3A 


102293 


U32439 


AF090116 


Hs.79348 


regulator of G-proteln signaling 7 


102298 


U32849 


AA382169 


Hs.54483 


N-myc (and STAT) hteractor 


102325 


U35139 


AI815867 


Hs.50130 


necdtn (mouse) homofog 


302344 


U36764 


6E303044 


Ha. 192023 


eukaiyctic translation Initiation factor 3. subunit 2 {beta, 36kD) 


102361 


U39400 


AA223616 


Hs.75859 


chromosome 1 1 open reading frame 4 


102367 


U39657 


U39656 


Hs. 118825 


mltogen-acrJvated protein kinase kinase 6 


102388 


U41344 


AA362907 


Hs76494 


proline arglnhe-neh end letidne-rfch repeat protein 


102394 


U41766 


NMJW3816 


Hs.2442 


a dlsintegrto and metalloprotainase domain 9 (meltrtn gamma) 


12S829 


U41813 


AF010258 


Hs.127428 


homeoboxA9 


102251 


U41815 


NIA.004398 


Hs.41706 


DEAD/H (Asp-Glu-Aia-Asp/Hs) box polypeptide 10 (RNA heficase) 


102409 


U43286 


BE300330 


Hs.118725 


selenophosphate synthetase 2 


133746 


U44378 


AW410035 


Hs.75862 


MAO (mothers against decapenteplegic, DrosophQa) homotog 4 


102423 


U44754 


Z47542 


Hs.179312 


smaD nudear RNA activating complex, polypeptide 1 , 43kD 


132828 


U47011_cds1 


AB014615 


Hs.57710 


fibroblast growth factor 8 {androgemnduced) 


130441 


U47077 


U63630 


Hs.155637 


protein kinase, DNA-activated, catalytic polypeptide 


102450 


U48251 


U48251 


Hs.75871 


protein kinase o DiiQing protein i 


129350 


U50535 


U50535 


Hs.110630 


Human BRCA2 region, mRNA sequence CG006 


102534 


U56833 


U96759 


Hs.198307 


von MppeRlndau binding protein 1 


130457 


U58091 


AB014595 


Hs.155976 


cuDin4B 


135055 


U58837 


AA019401 


Hs.93909 


cyclic nucleotide gated channel beta 1 


102560 


U59289 


R97457 


Hs.63984 


cadhertn 13, H-cadherin (heart) 


102567 


U59863 


U63830 


Hs.146847 


TRAF family member-associated NFKB activator 


134305 


U67122 


U61397 


Hs.81424 


ubkruiMce 1 (sentrin) 


102638 


U67319 


U67319 


Hs.9216 


caspase 7, apoptoslswetated cysteine protease 


132736 


U66019 


AW081883 


Hs.288261 


Homo sapiens cDNA: FU23037 fls, clone LNG02036, highly similar to HSU68019 Homo 


sapiens mad protein horn 


dog (hMAW) mRNA 




133070 


U69611 


U92649 


Hs.64311 


a dlsintegrin and metaitoprotelnase domain 17 (tumor necrosis factor, alpha, converting enzyme) 


102663 


LT70322 


NMJN2270 


Hs.168075 


karyopherln (importin) beta 2 


134660 


U73524 


U73524 


Hs.87465 


ATP/GTP-bindlng protein 


102735 


U79267 


AF111106 


Hs.3382 


protein phosphatase 4, regulatory subunit 1 


102741 


U79291 


AW959829 


Hs.83572 


hypothetical protein MGC14433 


101175 


U82671 cds2 


U82671 


Hs.36980 


melanoma antigen, family A, 2 


132164 


U84573 


AI752235 


Hs.41270 


procollagen-lysine, 2-oxoglutarate 5-dtoxygenase (lysine hydroxyiase) 2 


102823 


U90914 


D85390 


Hs.5057 


carboxypepfldaseD 


102826 


U91316 


NM_007274 


Hs.8679 


cytosolic acyl coenzyme A fhioester hydrolase 


102831 


U91932 


AA262170 


Hs.80917 


adaptor-related protein complex 3, sigma 1 subunit 


102846 


U96131 


BE264974 


Hs.6566 


thyroid hormone receptor interactor 13 


129777 


U97018 


U97018 


Hs.12451 


echrhoderm mlcrotubule-assodated protern-Bke 


134161 


U97188 


AA634543 


Hs.79440 


1GF-II mRNA-binding protein 3 


134854 


V00503 


J03464 


Hs.179573 


collagen, type 1, alpha 2 


302363 


X04327 


AW163799 


Hs.198365 


2,3-bispnosphoglycerate mutase 


133708 


X06389 


AI018666 


Hs.75667 


synaptophysin 


125701 


X07496 


T72104 


Hs.93194 


apo lipoprotein A-l 


102915 


X07B20 


X07820 


Hs.2258 


matrix metaitoprotelnase 10 (stromelysin 2) 


134656 


X14787 


AI750878 


Hs.87409 


thrombospondin 1 


413858 


X15525_ma1 


NM.001610 


HSJ5589 


sen piw^JiiaiaJw c, lyawUHiai 


102968 


X16396 


AU076611 


Hs.154672 


methylene tetrahydrofolate dehydrogenase (NAD+ dependent), melhenyttetrahydrofolale 


cyclohydrotase 








102971 


X16609 


X16609 


Hs.183805 


ankyrtn 1,eiythrocytic 


134037 


X53586_rna1 


AI808780 


Hs.2Z7730 


tntegrin, alpha 6 


103023 


X53793 


AW500470 


Hs.1 17950 


multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 


103037 


X54936 


BE018302 


Hs.2894 


placental growth factor, vascular endothelial growth factor-related protein 


130282 


X55740 


BE245380 


Hs.153952 


5' nucleotidase (C073) 


134542 


X57025 


M14156 


Hs.85112 


tnsutbviOte growth factor 1 (somatomedin C) 


128568 


X60673_ma1 


H12912 


Hs.274691 


adenylate kinase 3 


103093 


X60708 


S79876 


Hs.44926 


dipeptidytpeptldase IV (C026, adenosine deaminase comptexing protein 2) 


133606 


X62048 


U10564 


Hs75188 


wee1+ (S. pombe) homolog 


129063 


X63097 


X63094 


Hs.283822 


Rhesus blood group, D antigen 


424460 


X63563 


BE275979 


Hs. 296014 


polymerase (RNA) 11 (DNA directed) polypeptide B (140kO) 


133227 


X64037 


AW977263 


Hs.68257 


general transcription factor IIF, polypeptide 1 (74kO subunit) 


103181 


X69636 


X69636 


Hs.334731 


Homo sapiens, clone IMAGE:3448306, mRNA, partial cos 


103184 


X69878 


043143 


Hs.74049 


fms-related tyrosine kinase 4 


103194 


X70649 


NMJJQ4939 


Hs.78580 


DEAD/H (Asp-Gfu-Ata-Asp/His) box polypeptide 1 
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103208 


X72841 


AW411340 


H&31314 


refinoblastoma-bindlng protein 7 


1296SS 


X74987 


BE242144 


Hs. 12013 


ATP-bindbg cassette, sub-family E (OABPX member 1 


131466 


X83107 


F06972 


Hs.27372 


BMX norweceptor tyrosine kinase 


130729 


X84194 


AI983747 


Hs.18573 


acylphosphatase 1, erythrocyte (common) type 


103334 


X85753 


NM_001260 


Hs.25283 


cycHn-dependent kinase 8 


132645 


X87870 


AJ654712 


Hs.54424 


hepatocyte nuclear factor 4, alpha 


135094 


X89066 


NM.003304 


Hs.250687 


transient receptor potential channel 1 


103352 


X893S3_cds2 


H09366 


Hs.78853 


ura<&DNA gfyccsytase 


103353 


X89399 


X69399 


Hs.119274 


RAS p21 protein scfivator (GTPase activating protein) 3 (lns(1,3,4,5)P4-blndhg protein) 


132173 


X89426 


X89428 


Hs.41716 


endothelial ceD-spedflc molecule 1 


103371 


X91247 


X91247 


Hs.13046 


thbredaxin reductase 1 


131584 


X91648 


AA598509 


Hs.29117 


purine-rich element bhtfhg protein A 


103376 


X92098 


AL036166 


Hs.323378 


coated vesicle membrane protein 


103378 


X92110 


AL119590 


Hs.153618 


HCGVIIM protBin 


128510 


X94703 


X94703 


Hs.296371 


RAB28, member RAS oncogene family 


103410 


XS6506 


AA158294 


Hs.334879 


DR1 -associated protein 1 (negative cofactor 2 alpha) 


133490 


X97230J 


AF022044 ■ 


Hs.274601 


killer cell Immunoglobufln-IIke receptor, three domains, long cytoplasmic tall, 1 


103438 


X98263 


AW175781 


Hs.152720 


M-phase prcsphoprotein 6 


103440 


X98296 


X98296 


Hs.77578 


ubiqulttn specific protease 9, X chromosome (Drosophfla fat facets related) 


103452 


X99584 


NM_006936 


Hsi5119 


SMT3 {suppressor of mil two 3, yeast) homotog 1 


133536 


Y00264 


W25797.comp Hs, 177486 . 


amyloid beta (A4) precursor protein (protease nexin-ll, Alzheimer disease) 


135185 


Y07566 


AW404908 


Hs.96038 


Ric (Drosophfla}-iike, expressed In many tissues 


118523 


Y07759 


Y07759 


Hs.170157 


myosin VA (heavy polypeptide 12, rnyoxln) 


134662 


Y07827 


NM_007048 


Hs.284283 


butyrophiBn, subfamily 3, member A1 . 


132083 


Y07867 


BE386490 


Hs.279663 


PMn 


103500 


Y09443 


AW4O8O09 


Hs^2580 


alkylglycerone phosphate synthase 


134389 


Y09858 


Y09858 


Hs.82577 


spindlin-Cke 


132084 


Y12394 


NM.002267 


Hs.3886 


karyopharin alpha 3 (bnportln afcha 4) 


103540 


211559 


NKL0Q2197 


Hs.154721 


aconltase 1, soluble 


133152 


Z11695 


Z11695 


Hs.324473 


mitogen-activated protein kinase 1 


103548 


Z15005 


215005 


Hs.75573 


centromere protein E (312kD) 


103612 


246261 


BE336654 


Hs.70937 


H3 histone family, member A 


129092 


AA011243_s 


D56365 


Hs.63525 


poty{rC)-blnding protein 2 


103692 


AA018418 


AW137912 


HsJ227583 


Homo sapiens chromosome X map Xp1 1 .23 L-type calcium channel aipha*1 subunlt 


(CACNA1F) gene, complete cds; HSP27 pseudogene, complete sequence; and JM1 protein, JM2 protein, and Hb2£ genes, complete cds 


103695 AA018758 


AW207152 


Hs.186600 


ESTs 



129796 AA018804 
132258 AA031993 
132683 AA044217 
131887 AA046548 
member 1 

103723 AA057447^s 
453368 AA058376 
133260 AA083572 

103765 AA085696 

103766 AA088744 

103767 AA089688 
132051 AA091284 
103773 AA092700 
[C.elegans] 
135289 AA092968 
132729 AA094800 
103794 AA100219 
131471 AA1 14885 
134319 AA129547 
103807 AA133016 
119159 AA149507 
129863 AA151005 
103850 AA187101 
103855 AA195179_3 
322026 AA203138 
135300 AA203645 
103861 AA206236 
130634 AA227621 



65 {Oelegans] 



447735 AA248283 
103909 AA249611 
131236 AA282640 
134060 AA287199 
129013 AA313990 
129435 AA314256 
103988 AA314389 
104000 AA324364 
425284 AA329211_s 
128629 AA399187 
133281 AA421079 



BE218319 Hs.5807 GTPase Rab14 

AA306325 Hs.431 1 SUMO-1 activating enzyme subunlt 2 

BE264633 Hs. 143638 WD repeat domain 4 

W17064 Hs.332848 SW1/SNF related, matrix associated, actin dependent regulator of chromatin, subfam9y e, 

BE274312 H&214783 Homo sapiens cONA FU14041 fis, done HEMBA1005780 

W20296 Hs.288178 Homo sapiens cONAFU 11 968 fis, clone HEMBB1 001 133 

AA403045 Hs.6906 Homo sapiens cDNA: FU23197 fis, clone REC00917 

AA085696 Hs. 169600 KIAA0826 protein 

AI920783 Hs.191435 ESTs 

BE244667 Hs.296155 CGM00 protein 

Hs.180145 HSPC030 protein 

Hs.101077 ESTs, Weakly similar to T22363 hypothetical protein F47G9.4 - CaernrhabdHls etegans 

Hs.9788 hypothetical protein MGC1 0924 similar to Nedd4 WW-blndlng protein 5 

Hs. 55682 eukaryotic translation initiation factor 3, subunlt 7 (zeta, 66/67kD) 

Hs.30670 hepatocellular cardnorrcnissocfated antigen 66 

Hs.192619 WAA1600 protein 

Hs.75653 (urrarate hydratase 

Hs.103832 similar to yeast UpO, variant B 

Hs. 15020 homotog of mouse quaking OKI (KH domain RNA binding protein) 

Hs. 129872 sperm associated antigen 9 

Hs.213194 hypoftetJcal protein MGC10895 

Hs.302267 hypothetical protein FU10330 

Hs.283675 NPD009 protein 

Hs.278626 Arg/AbHnteracfJng protein ArgBP2 

Hs.4944 hypothetical protein RJ12783 

Hs.127824 ESTs, Weakly similar to T28770 hypothetical protein W03O2.1 

Hs.6127 Homo sapiens cONA: RJ23020 fis, clone LNG00943 

Hs.47438 SH3 domain binding glutamic acid-rich protein 

Hs.24594 ubiquitinatbn factor E4B (homologous to yeast UFD2) 

Hs.78671 mesoderm development candidate 2 

AA371156 Hs.107942 DKFZP564M1 12 protein 

AF151852 Hs.111449 CGt-94 protein 

AA314389 Hs.42500 ADP-ribosylation factor-like 5 

A1146527 Hs.80475 polymerase (RNA) U (DNA directed) polypeptide J (13.3kD) 

AF155568 Hs, 155489 NS1 -associated protein 1 

AL096748 Hs. 102708 DKFZP434A043 proteh 

AK001601 Hs.69594 Wgh-mobOity group 20A 



A1219323 

AW372569 

AW970843 

AF244135 

AA164842 

BE304999 

AW958264 

AF142419 

BE379765 

AA167101 

W02363 

AW024973 

AA142922 

AA206236 

AI769067 

AA775268 
AA249611 
AF043117 
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104104 AA422029 
[H .sapiens} 
108154 AA425230 
132091 AA447052 
135073 AA452000 
131367 AA456687 
129593 AA487015_3 
135266 AB002326 
133505 C01527 
132064 C01714 
134393 C01811J 
131427 C0235^s 
133435 C02375 
104262 C14448 
134827 D16610 
130443 025216 
131742 031352 
132837 D58024_s 
130377 080897 
104334 082614 
134593 D87845 
134731 089377J 
129913 H06583 
131670 H40732 
104394 H46617 
104402 H56731 
129781 H75570 
129077 H78886 
104417 H81241 
134927 L36531 
129280 M63154 
134498 M63180 
104460 M91504 
104486 N56191 
131248 M78483 
129214 N79266 
130017 R14652 
104530 R20459 
104534 R22303 



AA422029 Hs.143640 ESTs, Weakly slmSar to hyperpoIartzatoiwcOvaled cyclic nudeoflde-gated channel hHCN2 

NMJ305754 Hs^20689 Ras<3TPas9^d)vatlng protein SH3^kOTain-btaaTng protein 

AW954243 Ha.170218 KIAA0251 protein 

W55956 Hs.94030 Homo sapiens mRNA; cONA 0KF2p586E1624 (from done DKFZp586E1624) 

AI750575 Hs.173933 nuctear factor l/A 

AI338247 Hs.98314 Homo sapiens mRNA; cOMA DKFZp586L0120 (from done DKFZp586U)120) 

R41179 Hs.97393 WAA0328 protein 

AI630124 H&324504 Homo sapiens mRNA; cONA DKFZp586J0720 (from done DKFZp586JO720) 

AA121098 Hs3838 serorrOnducible kinase 

W52642 Hs.8261 hypothetical protein FU22393 

AF151879 H*l26706 CGM21 protein 

A 1929357 Hs.323966 Homo sapiens done H63 unknown mRNA 

C14448 Ha332338 EST 

BE314037 Hs.89866 ooprocoiphyrtnogen oxidase (coproporphyria, haro"erapcrphyrta) 

025216 Hs.155650 K1AAO0 14 gene product 

AA981420 H&31433 ESTs 

AA370362 Hs.57958 EGF-W^trophlDn-relatBd protein 

NM.014909 Hs.155182 WAA1036 protein 

DS2614 Hs.78771 phosphoglycerate kinase 1 

NMJH0437 Hs.234392 pfateietscfivatina factor acetylhydrofese 2 (40kD) 

089377 Hs.89404 msh (DrosophEa) homeo box homotog 2 

NM_001310 Hs.13313 cAMP responsive element binding protein-tike 2 

H03514 Hs.10130 ESTs 

AA129551 Hs.172129 Homo sapiens cONA: FU21409 Gs, done COL03924 

H56731 Hs.132956 ESTs 

AA306090 Hs.124707 ESTs 

N74724 Hs.108479 ESTs 

AI819448 Hs.320861 KruppeWike factor 8 

L36531 Hs.91296 Integrin, alpha 8 

M63154 Hs.1 10014 gastric intrinsic factor (vitamin B synthesis) 

AW246273 Hs.84131 thieonyHRNA synthetase 

AW955705 Hs.62604 Homo sapiens, dons IMAGE:4299322, mRNA, partial cds 

N56191 Hs.106511 protocadhertn 17 

AI038989 Hs.332633 Barde^Bfedl syndrome 2 

AL044335 Hs.1 09526 zinc finger protein 198 

AKD00096 Hs.143198 inhibitor of growth family, member 3 

AK001676 Hs.12457 hypothetical protein FU10814 

R22303 gb.Yh26b09Jl Soares placenta Nb2HP Homo sapiens cONA done IMAGE: 130841 5, mRNA 

H&222362 ESTs,WeaMysImilartop40iH5aplen3] 

Hs.265327 hypothetical protein OKFZp761l141 

Hs.5672 hypoOieUcal protein AF140225 

Hs.101490 ESTs 

Hs.278428 progestin induced protein 

Hs.19280 cysteine-nch motor neuron 1 

Hs.151251 ESTs 

Hs.14846 Homo sapiens mRNA; cONA DKFZp564O016 (from clone OKFZp564O016) 

Hs 30098 ESTs 

Hs36250 ESTs, Weakly similar to 138022 hypothetical protein [H .sapiens! 

Hs.278585 ESTs 

Hs.10031 KIAA0955 protein 

gb2e97d1U1 Soares JetaLhearLNbHHl9W Homo sapiens cONA done tMAGE:366933 J 



104544 R33779 
133328 R36553 
104567 R64534 
128562 R66475 
129575 R70621 
130776 R79356 
104599 R84933 
104660 RCJW007160 
104667 RC_AA007234_s AI239923 
104718 RCJ\A018409 AI143020 
104764 RC_AA025351 AI039243 

104786 RCJWJ27168 AA027167 

104787 RCJWQ27317 AA027317 
similar to contains Afu repetitive element mRNA sequence. 



AI091173 

AW452738 

AA040620 

AA923382 

F08282 

AF167706 

AW815036 

BE298665 



134079 RCJW29423 AK001751 

104804 RC.jAA031357 AI858702 

104865 RCLAA045136 T79340 

130828 RC_MQ53400 AW631469 

104907 RC_AA055829 AA055829 
WARNING ENTRY ^.sapiens] 

104943 RCJWJ65217 ATO72873 

105013 RCJVA1 16054 H63789 

105024 RC.AA126311 AA126311 

132592 RC.AA129390 AW803564 

105038 RCJW130273 AW503733 

105077 RCJVA142919 W55946 

105096 RC_AA1 50205 AU042506 

129215 RCJVA176867 AB040930 

105169 RCJW80321 BE245294 

132796 RCJW80467 NMJJQ6283 

130401 RCJW187634 BE396283 

105200 RCJVA195399 AA328102 

130114 RCJ\A234717 AA233393 

105330 RC_AA234743 AW338625 

105337 RCJ\A234957 AJ468789 

129385 RCJ\A235604 AA172106 



Hs.171835 hypothetical protein FU10889 

H&31803 ESTs, Weakly similar to N-WASP [Haptens] 

Hs.22575 B-cefl CUJrymphoma 6, member B (zinc finger protein) 

Hs.203213 ESTs 

Hs.196701 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.1 14218 frizzled {DrosopMa) homotog 6 

Hs.296288 ESTs, Weakly similar to K1AA0838 protein [Hxapians] 

Hs.9879 ESTs 

Hs.288850 Homo sapiens cONA: FU22528 fls, done HRC12825 

Hs.9414 KIAA1488 protein 

Hs.234863 Homo sapiens cONA FU12Q82 Rs, done HEMB81002492 

Hs^1599 Kruppei-to factor 7 (ubiquanus) 

Hs.126085 KIAA1497 protein 

Hs.180789 S164 protein 

Hs.173159 transforming, acidic colled-coH containing protein 1 

Hs.173987 eukaryoflc translation initiation factor 3, subunit 1 (alpha, 35kO) 

Hs.24641 cytoskeleton associated protein 2 

Hs.1 4992 hypothetical protein FU11 151 

Hs.22120 ESTs 

Hs.23200 myotubuiarin related protein 1 

Hs.1 10950 Rag C protein 



86 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



105376 RC.AA236559 
105397 RC_AA242868 
131962 RCLAA251776 
131991 RCJW251909 
128658 RC_AA252872_s 
105489 RCJW256157 
105508 RCJVA25668Q 
105539 RC_AA258873 
135172 RCJW262727 
131569 RCJW281451 
132542 RCJU281545 
105643 RCJ\A282069 
105659 RC_AA283044 
105666 RCJ\A283930 
105674 RCJVA284755 
105709 RCJ\A291268 
105722 RCJ\A291927 
105765 RCJW343514 
115951 RCJU398109 
105962 RC_AA405737 
105985 RCJM406610 
gteXQ2067 

106008 RCJW411465 
131216 RCJW416886 
134222 RCJ\A424013 
113689 RC.AA424148 
106141 RCJVA424558 
130839 RCJM2496U 
106157 RCJW425367 
130777 RCJVA425921 
receptor 

130561 RCJ*A42622Q 
106196 RC_AA427735 
WARNING 

131878 RCJW430673 
133200 RCJVA432248 
106302 RC_AA435896 
106328 RCJ\A436705 
450534 RCJVA446561 
106423 RC_AA448238 
133442 RC _AA44868fi 
439608 RC.AA449756 
106477 RCJ\A45Q303 
106503 RCJVA452411 
446999 RCJW454566 
106543 RCJW454667 
130010 RCJW456437 
106589 RCJW56646 
106593 RCJW456826 
106596 RCJW456981 
CONTAMINATION 
134655 RCJ\A458959 
member 1 

106636 RCJW459950 
106654 RCJW460449 
131353 RCJ\A463910 
106707 RCLAA464603 
131710 RCJW464606 
106717 RCJW465093 
131775 RCJ\A465592 
106747 RCJVA476473 
106773 RC_AA478109 
106781 RCJW478474 
106817 RC_AA480889 
106846 RCJVU85223 
106848 RCJW485254 
106856 RC_AA486183 
418699 RCJW495936 
WARNING 

107001 RC.AA598589 
130638 RC_AA59B831_f 
107054 RCJVA600150 
107059 RCJ\A608545 
107080 RC_AA609210 
107115 RCJ\A610108 
107130 RCJW620582 



AW994032 Hs*768 hypolhefical|XDtehaJ10849 

AA814807 Ha.7395 hypotiefcai protein FU23182 

AK000046 Hs^67448 hypometlcal protein FU20039 

AF053306 Hs.36708 buiwhg uninhibited by berdmtdazoles 1 (yeast homolog), beta 

BE397354 Hs.324830 dlptheria toxin resistance protein required for diphthamide biosynthesis (SaccharomycesHike 2 

AA256157 Hs.24115 Homo sapiens cONA FU14178 fls, clone NT2RP2003339 

AA173942 HsJ326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (from clone DKFZp564H1916) 

AB040884 Hs, 109694 WAA1451 protein 

AB028956 Hs.12144 K1AA1033 protein 

AL389951 Hs.271623 nudeoportnSflkD 

AL137751 Hs.263671 Homo sapiens mRNA; cONA DKFZp434J0812 (trom done DKFZp434IQ812)i partial cds 

BE621719 Hs.173802 K1AA0603 gene product 

AA283044 Hs.25625 hypotheSca) protein FU11323 

AA428234 Hs .34906 ESTs, WeaWy similar to T17210 hypothetical protein DKFZp434N041.1 [H.saplens] 

AI609530 Ks.279789 hlstone deaoetytase 3 

A1928962 Hs^6761 DKFZP586L0724 protein 

A1922821 Hs^2433 ESTs 

AA299688 Ks.24183 ESTs 

BE546245 Hs.301048 sed3-fixe protein 

AW8&0358 Hs.339808 hypothetical protein aj 10120 

AA406610 gb:zv15b10.s1 Soarcs_NhHMPu_S1 Homo sapiens cONA done IMAGE753691 J similar to 

AB033888 Hs.8619 SRY (sex determining region Y)-box 18 

AI815486 Hs^43901 Homo sapiens cONA FU20738 fis, clone HEP08257 

AW855861 Hs.8025 Homo sapiens done 23767 and 23782 mRNA sequences 

AB037850 Hs, 16621 DKFZP434I116 protein 

AF031463 Hs.9302 phosducWDce 

AB01 1 169 Hs.20141 similar to S. cerevlslae SSM4 

W37943 Hs.34892 K1AA1323 protein 

AW135049 Hs.285418 Hornosapiers^FU1C643fis.ctoneNT2RP200^ 

AB0 11095 Hs.16032 KIAA0523 protein 

AA525993 Hs. 173699 ESTs, WeaWy similar to ALU1 JUJMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AA083764 Hs.6101 hypothefical protein MGC3178 

AB037715 Hs.183639 hypothetical protein FU10210 

AA396859 Hs. 18397 hypothetical protein FU23221 

AL079559 Hs.28020 K1AA0766 gene product 

AI570189 Hs.25132 KIAA0470 gene product 

AB020722 Hs.16714 Rho guanine exchange factor (GEF) 15 

AL137663 Hs.7378 Homo sapiens mRNA; cONA DKFZp434G227 (from done DKFZp434G227) 

AW864696 Hs.301732 hypothetical protein MGC5306 

R23324 Hs.41693 DnaJ (Hsp40) homolog, subfamily B, member 4 

AB033042 Hs.29679 oofactor required for Spl transcriptional activation, subunit 3 (130kD) 

AA151520 Hs.334822 hypothetical protein MGC4485 

AA676939 Hs.69285 neuropOin 1 

AA301116 Hs. 142838 nucleolar phosohoprateln Nopp34 

AK000933 Hs.28661 HornosapienscDNAFU10(l71 fe,cb^ 

AW296451 Hs.24605 ESTs 

AA452379 Hs.293552 ESTs, Moderately similar to ALU7JWMAN ALU SUBFAMILY SQ SEQUENCE 

AF265208 Hs.123090 SWVSNF related, matrix associated, acfln dependent regulator of chromatin, subfamily f, 

AW956037 Hs2B6 ■ r&osomal protein L4 

AW075485 Hs.286049 phosphosertne aminotransferase 

AW754182 gfcR<2-CT0321-131199-011-c01 CT0321 Homo sapiens cONA, mRNA sequenoe 

AK000566 Hs.98135 hypothetical protein FU20559 

NM.015368 Hs.30985 pannexinl 

AA600357 Hs.239489 TJA1 cytotoxic granule-associated RNA-binding protein 

AB014548 Hs.31921 WAA0648 protein 

NM.007118 Hs.171957 triple functional domah (PTPRF Interacting) 

AA478109 Hs.188833 ESTs 

AA330310 Hs.24181 ESTs 

D61216 Hs.18672 ESTs 

AB037744 Hs.34892 WAA1323 protein 

AA449014 Hs. 121 025 chromosome 1 1 open reading frame 5 

W56353 Hs.285123 Homo sapiens mRNA full length insert cONA done EUROIMAGE 2005779 

BE539639 Hs.173030 ESTs, Weakly similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

At926520 H&31016 putative DNA binding protein 

AW021276 Hs.17121 ESTs 

AHJ76459 Hs.15978 WAA1272 protein 

BE614410 Hs.23044 RAD51 (S. cerevisbe) homotog (E coll RecA homolog) 

AL122043 Hs.19221 hypothetica) protein DKFZp566G1424 

BE379623 Hs.27693 peptidytprolyi isomerase (cydophiHnHike 1 

AB033106 Hs.12913 KIAA1280 protein 
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107156 RCJ\A621239 
107174 RCJW621714 
130621 RC_M621718 
107190 RCJ)19673 
132626 RC_D25755_s 
107217 RCJ)51095 
131610 RCJD60272J 
129604 T08879 
107295 T34527 
(GalNAc-T1) 
107299 T40327.3 

107315 T62771.8 

107316 T63174_3 * 
107328 T63444 
107334 T93641 
134715 U48263 
128636 U490S5 
129938 079300 
107375 U88573 
130074 U93867 
107387 W01094 
132036 W01566 
107426 W26853 
113857 W27179 
135388 W27965 
130419 W36280_8 
107469 W47063 
132616 W79060 
107506 W88550 
132358 X60486 
107522 X7893U 
125827 Z14077_s 
107582 RC_M002147 
107609 RCJU004711 
107661 RCJVA01Q383 
107714 RCJWHS761 
107775 RCJVA018772 
107832 RCJ\A021473_/ 



107859 RCJW024835 
124337 RCJ\A025858 
107914 RCjyA027229 
[C.elegans] 

107935 RCJW029428 
116262 RCJ\A035143 
131461 RC_AA035237 
108007 RC.M039347 
108029 RCJVA040740 
108040 RC_AA041551 
member 1 

108084 RC_AA045513 
108088 RCJW045745 
108168 RCJ\A055348 
130719 RC_AA056582_s 

108189 RCJWJ56697 

108190 RCJVA056746 
108203 RCJ\A057678 

108216 RCJ\A058681 

108217 RCJW058686 
108245 RCJWO 62840 
108277 RC_AAQ64859 
mRNA 

108280 RCJWJ65069 
108309 RCJVA069923 
133739 RCJW0799_s 
108340 RCJW070815 
108403 RCJ\A075374 
3", mRNA sequence. 
108427 R<LAA076382 
3', mRNA sequence. 
108435 RCLAA078787 
108439 RCJ\A076986 
3', mRNA sequence. 
108465 RCJW9393 
108469 RCJ\A079487 
sequence 



AA137043 Hs.9663 programmed cell deaft 6-mteractlng protein 

BE 122762 Hs.25338 ESTs 

AW513087 Hs. 16803 LUC7 (S. cerevte!ae)-Cke 

AA836401 Hs3103 ESTs 

AW504732 Hs.21275 hypothetical protein FU11011 

AL080235 Hs.35861 OKFZPS86E1621 protein 

AA357879 Hs^9423 scavenger receptor with Wype lectin 

AF088886 Hs.11590 cathepslnF 

AA186629 Hs.80120 UDP-f4-acetyt^ha-D$aJ^^ 

BE277457 Hs.30661 hypoffieflca) protein MGC4606 

AA316241 Hs.90691 nudeophosroirimudeoplasmln 3 

T63174 his. 193700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from clone DKFZp586l0324) 

AW959891 Hs.76591 KIAA0887 protein 

T93597 Hs.187429 ESTs 

U48263 Hs.89040 prepronodceptin 

U49065 Hs.102865 interteukin 1 receptor-like 2 

AW003668 Hs.135587 Human done 23629 mRNA sequence 

BE011645 Hs.251064 rOgh-mobHity group {nonhistone chromosomal) protein 14 

ALQ38596 Hs.250745 polymerase (RNA) 111 (DNA directed) (62kD) 

D86983 Hs. 118893 Melanoma associated gene 

AL157433 Hs.37708 hypothetical protein DKFZp434£2220 

W26853 Hs.291003 hypothetical protein MGC4707 

AW243158 Hs.5297 DKFZP564A2416 protein 

W27965 Hs.99865 eptmorphin 

AF037448 Hs.155489 NS1-assodated protein 1 

W47063 Hs.94668 ESTs 

BE262677 Hs.283558 hypothetical protein PR01855 

AB028981 Hs.8021 KIAA1058 protein 

NM_003542 Hs.46423 H4 histone family, member G 

X78931 Hs.99971 zinc finger protein 272 

NM_003403 Hs.97496 YY1 transcription factor 

AA002147 Hs.59952 EST 

R75654 Hs.164797 hypothetical protein FU13693 

AA010383 Hs.60389 ESTs 

AA015761 Hs.60642 ESTs 

AW008846 Hs.60857 ESTs 

AA021473 gbae66c11.8l Scares refina N2b4HR Homo sapiens cDNA done IMAGE363956 3\ mRNA 

AW732573 Hs.47584 potassium voltage-gated channel delayed-rectifler, subfamily S, member 3 

N23541 Hs.281561 Homo sapiens cDNA: FU23582 fc» clone LNG13759 

AA027229 Hs.61329 ESTs, WeaWy similar to T1 6370 hypothetical protein F45E12.5 - Caenorhabditis eiegans 

AA029428 Hs.61555 ESTs 

A1936442 Hs.59838 hypothetical protein FL) 10808 

AA992841 Hs.27263 K1AA1453 protein 

AA039347 Hs.61916 EST 

AA040740 Hs.62007 ESTs. 

AL121031 Hs.159971 SWl/SNF related, matrix associated, actn dependent regulator of chromatin, subfamily b, 

AA058944 Hs.1 16602 Homo sapiens, done IMAGE:41540Q8, mRNA, partial cds 

AA045745 Hs.62886 ESTs 

A1453137 Hs.63176 ESTs 

AA679262 Hs. 14235 hypoffieflca/ protein RJ20008; K1AA1839 protein 

AW376061 Hs.63335 ESTs, Moderately sbnOar to A46010 X-finked retinopaSiy protein (Rsapiens] 

AA056746 Hs.63338 EST 

AW847814 HS.289QQ5 Homo sapiens cDNA: FU21532 Ss, done COL06049 

AA524743 Hs.44883 ESTs 

AA058686 Hs.62588 ESTs 

6E410285 H&89545 proteasome (prosome, macropaln) subunlt, beta type, 4 

AA064859 gb:zm50f03.s1 Stratagene foroblast (937212) Homo sapiens cDNA done IMAGE:529085 3*. 

AA065069 gb:zm12e1 1 jl Stratagene pancreas (937208) Homo sapiens cDNA clone 3'. mRNA sequence 

AA06981 8 gbzm67e03 1\ Stratagene neuroepHheOum (937231) Homo sapiens cDNA clone 9 similar to 

6E536554 Hs .278270 unactive progesterone receptor, 23 kD 

AA069820 Hs.1 80909 peraxfredoxb 1 

AA075374 gb:zm87a01.s1 Stratagene ovarten cancer (937219) Honw sapiens cDNA done IMAGE^44872 

AA076382 gb:zm91g08.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done IMAGE:545342 

T82427 Hs.194101 Homo sapiens cDNA:FU20869fe l done ADKA02377 

AA078986 gb:zm92h01 .si Stratagene ovarian cancer (937219) Homo sapiens cDNA clone IMAGE:545425 

AA079393 Hs.3462 cytochrome c oxidase subuntt Vile 

AA079487 gfczm97f08,s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 3', mRNA 
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108500 RCJW083207 

108501 RC.M083256 
gb:M333Q8 

108533 RCJVA084415 
mRNA 

1085S2 RCM085274 
gb:X15341 

108589 RC_MQ88678 

130890 RCJW 00925 

134585 RC.AA1Q1255 

130385 RCJ\A126474 

108749 RCJW127017 

108807 RCJ\A129968 

108808 RCJW30240 
108833 RCJW131866 
107290 RC.AA132039 
108846 RCJW132983 
108857 RCJW133250 
131474 RCJW33563_s 
108894 RCJ\A135941 
108941 RCJW148650 
IMAGE:5672023'. 
108968 RCJ\A151110 
108996 RCJ\A155754 
109001 RCLAA156125 
131183 RCJW56289 
109019 RCJW156997 

109022 RCJW57291 

109023 RCJW57293 
109068 RCJW164293J 
109072 RCJW164676 
129021 RCJW67375 
130346 RCJW67550 
109146 RC.AA176589 
109172 RCJW180448 
131080 RCJW187144_s 
129208 RCJW89170J 
109222 RCJVA192757 
109300 RCJ\A205650 
109481 RCJW233342 
109485 RCJVA233472 
109516 RC_AA234110 
109537 RC.D80981 
109556 RC_F01660 

109577 RC_F02206 

109578 RC_F02208 
109595 RCF02544 
109625 RC.F03918 
131983 RC_F04258jb 
109648 RC.F04600 
109671 RC_F08998 
109699 RC./09605 
109820 RCJF11115 
109933 RC_H06371 
110014 RCJU0995 
110039 RQ_H11938 
110099 RCJH6568 
110107 RCJi16772 
110155 RC_H18951 
110197 RC_H20859 
110223 RC_H23747 
110306 RCJH38Q87 
110335 RC_H40331 
110342 RCLH40567 
110395 RC_H46956 
110511 RC_H56640J 
110523 RCLH57154 
110715 RC.H96712 
110754 RC_N20814 
130132 RCJI25249 
131135 RCJB7100 
134263 RCJ439616 
110938 RCJM69B2 
110983 RC.N51957 
115062 RCJ452271 
111081 RCJ459435 



AA083207 
AA083256 

AA084415 

AA100796 

AI732404 

AI907537 

D14041 

AW067800 

AA127017 

A1652236 

AA045088 

AF188527 

W27740 

AL1 17452 

AKD01468 

L46353 

AK001431 

AA148650 

A1304870 

AW995610 

AI056548 

AI611807 

AA156755 

AA157291 

AA157293 

AA164293 

AJ732585 

AL044675 

H05769 

AA176589 

AA180448 

NM.001955 

AI587376 

AA192833 

AA418276 

AA878923 

BE619092 

AW71639 



H&68270 EST 



AI925294 

FO2206 

F02208 

AA078629 

H29490 

AF1 19565 

H17800 

R59210 

H18013 

AW016809 

R52417 

AL109666 

H11938 

R44557 

AW151660 

A1559628 

AW090386 

H19836 

H38087 

H65490 

H40961 

AA025116 

H56640 

AI040384 

H96712 

AW302200 

U55936 

NR.016569 

AW973443 

N48982 

NM.015367 

AA253314 

AJ146349 



Hs.68846 

Hs.76698 

Hs.278573 

Hs.155223 

Ha.71052 

Hs.49376 

Hs.62738 

Hs.61661 

Hs.323780 

Hs.44155 

Hs.62180 

Hs.2726 

Hs£105 



Hs.188680 

Hs.332436 

Hs.72116 

H5285107 

Hs.72150 

Hs.21479 

Hs.72168 

Hs.72545 

Hs.22394 

Hs.173081 

Hs.188757 

Hs.142078 

Hs.144300 

Hs.2271 

Hs.109441 

Hs^33512 

Hs.170142 



gb2n08g1Zs1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone 3' similar to 
gbzn08g09.s1 Stratagene HNT neuron (937233) Homo sapiens cDNA done IMAGE546668 3", 
gb:zm26c06.s1 Stratagene pancreas (937208) Homo sapiens cOMA done 3* similar to 
ESTs 

stress-associated endoplasmic reticulum protein 1; rtbosome associated membrane protein 4 
H-2K binding factor-2 



Hs28465 

Hs.71913 

Hs.34898 

Hs.87385 

Hs.296639 

Hsi72l4 

Hs.27301 

Hs.22697 

Hs.184011 

Hs.7154 

Hs.26634 

Hs.167483 

Hs.323795 

Hs.20945 

Hs.7242 

Hs-21907 

Hs.23748 

Hs31444 

Hs.93522 

Hs.1 12278 

Hs31697 

Hs.105509 

Hs.18845 

Hs.33008 

Hs^3333 

Hs221460 

Hs.19102 

Hs.269029 

Hs.6336 

Hs.184376 

Hs.267182 

Hs.8086 

Hs.38034 

Hs.10267 

Hs.154103 

Hs.271614 



hypoflietical protein FU20644 
ESTs 

ESTs, Weakly similar to AF 174605 1 F-box protein Fbx25 [H.saplens] 
ESTs 

DKFZP586G1517 protein 

anlfiai (DrosophSa Scraps homobg), adin binding protein 
high-mobility group (nonhistone chromosomal) protein Isoform l-C 
hypothetical protein FU10569 

gbzo09e06.s1 Stratagene neuroepitheSum NT2RAM1 937234 Homo sapiens cONA done 

ESTs 
EST 

hypothetical protein FU20992 similar to hedgehog-interacting protein 

hypothetical protein RJ13397 

ESTs 

ubinuctein 1 

ESTs 

ESTs 

hypothetical protein FU10893 
KIAA0530 protein 

Homo sapiens, done MGC'5564, mRNA complete cds 

EST 

EST 

endoihetlnl 
MSTP033 protein 
similar to rat rnyomegaBn 
ESTs 

hypothetical protein FLJ21 01 6 

Homo sapiens cONA: FU21869 (Is, clone HEP02442 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit (HERG-3) mRNA, complete cds 

ESTs 

ESTs 

ESTs 

pyrophosphatase (Inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 

Homo sapiens mRNA full length Insert cONA done EUROIMAGE 35907 

hlstone acetyftransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1647 protein, partial cds 

arresfin, beta 1 

ESTs 

CTL2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to organic anion transporter 1 [H.saplens] 
ESTs 

WAA0672 gene product 
synaptosomai-assodated protein, 231<D 
TBX3-iso proteh 

RNA (guanme-7-) rnethyttransferase 

Homo sapiens cONA FU12924 (is. done NT2RP2004709 

MH1 protein 

UM protein (similar to rat protein kinase C-binding enigma) 
CGM12 protein 
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111128 RCJW4139 

135244 RCLN66981 

111216 RCJI68840 

437562 RCJI69352 

131002 RC.N95226 

111399 RCJIQ0138 

111514 RCjR07998 
similar to 

130182 RC.R08929 

111574 RCLR10307 

111804 RC.R33354 

111831 RC_R36083 

129675 RC_R37938_f 

111904 RCJU9330 
sequence 

133868 RC_R40816_s 

112033 RCJM316^s 

130987 RCJM5698 

112300 RCJ*54554 

112513 RCJ*68425 

112514 RC.R68568 
112522 RCJR68763 
112540 RC_R70467 
130346 RC_R73565 
129534" RCJU2640 
112597 RCJ*78376 
112732 RC.R92453 
131458 RCJ03865 
112888 RC_T03872 
131863 RCJ10072 
112911 RCJT10080 
132215 RC.T10132 
112931 RC.T15343 
112984 RC.T23457 
112998 RCJ23555 
133376 RCJ23870 
113026 RC_T23948 
113070 RCJT33464 
128970 RCJT34413 
113074 RC^T34611 
113095 RC.T40920 
113179 RCJ55182 
113337 RC_T77453 
113421 RCJT84039 
113454 RCJT86458 
113481 RCJB7693 
131441 RQJ89350_8 
113557 RCJ90945 
113559 RC„T90387 
113589 RC_T91863 
113591 RC_T91881 
113619 RC_T93783.s 
113683 RCT96687 
113692 RCJ96944 
113702 RCJT97307 
mRNA 

113717 RC_T97764 

113824 RCJW48817 

113840 RCJV58343 

113844 RC.W59949 
PROTEIN TC10 

113902 RC.W74644 

113904 RC.W74761 

113905 RCJV74802 

113931 RC_W812Q5 

113932 RCJV81237 
131965 RCJV90146J 
114035 RC_W92798 
114106 RC_Z38412 
133593 RC_Z38709 
114161 RC^Z38904 
424949 RC_Z39103 
129059 RCJ39930J 
128937 RCLZ39939 
WARNING 

130983 RC.Z40012J 



AW505364 Hs,19074 LATS {large tumor suppressor, Drosophila} homolog 2 

A1834273 Hs.9711 novel protein 

AW139408 Ha 152940 ESTs 

AB001636 Hs.5683 DEADJH (Asp-GJu-Ala-Asp/HIs) box polypeptide 15 

AL050295 Hs.22039 WAA0758 protein 

AW270776 Hs.18857 ESTs 

R07998 gb:yf16g11.s1 Soares fey Over spleen 1NRS Homo sapiens cDNA clone 1MAGE:127076 3 

BE267033 Hs.192853 ubiquldrwonjuga^g enzyme E2G 2 (homologous to yeast UBC7) 

A1024145 Hs.188526 ESTs 

AA482478 Hs.181785 ESTs 

R36095 Hs.268695 ESTs 

NMJJ15556 Hs.172180 WAA0440 protein 

Z41572 gb:HSCZYB122 normalized Infant brain cDNA Homo sapiens cONA clone c-zyb12, mRNA 

AB012193 Hs.183874 cuffin4A 

R49031 Hs.22627 ESTs 

BE613269 Hs.21893 hypomefical protein DKFZp761N0624 

H24334 Hs.26125 ESTs 

R66425 Hs.13809 hypothelica) protein FU10648 

R68568 Hs.183373 src homology 3 domain-containing protein HIP-55 

R68857 Hs.265499 ESTs 

R69751 gtoyMOalOsI Soares placenta Nb2HP Homo sapiens cDNA clone 3', mRNA sequence 

H05769 Hs.188757 Homo sapiens, done MGC:5564, mRNA, complete cds 

AK002126 Hs.11260 rr/potheffcaJ protein FU 11 264 

R78376 Hs.29733 EST 

R92453 Hs.34590 ESTs 

BE297567 Hs.27047 hypoftefcal proteh RJ20392 

AW195317 Hs.107716 hypofteOcal protein FU22344 

AI656378 Hs.33461 ESTs 

AW732747 Hs.13493 like mouse brain protein E46 

AL035703 Hs.4236 WAA0478 gene product 

T02966 Hs.167428 ESTs 

T16971 Hs.289014 ESTs, Weakly slmflar to A43932 mucin 2 precursor, intestinal [H^apiens] 

H11257 Hs.22968 Homo sapiens done IMAGE:451939, mRNA sequence 

BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 

AA376654 Hs.183684 eukaryotic (ranslation Inflation factor 4 gamma, 2 

AB032977 Hs.6298 K1AA1151 protein 

A1375672 Hs.165028 ESTs 

AK001335 Hs.31137 protein tyrosine phosphatase, receptor type, E 

AA828380 Hs.126733 ESTs 

BE622021 Hs.1 52571 ESTs, Highly similar to IGF-il mRNA-bindlng protein 2 [Ksapiens] 

T77453 Hs.302234 ESTs 

AJ769400 Hs.189729 ESTs 

AI022166 Hs.16168 ESTs 

T87693 Hs.204327 EST 

AA302862 Hs.90063 neurocaldn delta 

H66470 Hs.16004 ESTs 

T79763 Hs.14514 ESTs 

AI078554 Hs.15682 ESTs 

T91881 Hs.200597 KIAA0563 gene product 

R08665 Hs.17244 hypothetical protein FU 13605 

AB035335 Hs. 144519 T-cellleukemia/lymphoma6 

AL360143 Hs,17936 DKFZP434H132 protein 

T97307 gb:ye53h05.s1 Soares fetal fiver spleen 1NFLS Homo sapiens cDNA done IMAGE:121497 3', 

T99513 Hs.187447 ESTs 

A1631964 Hs.34447 ESTs 

R72137 Hs.7949 DKFZP58682420 protein 

AI369275 Hs.243010 Homo sapiens cDNA FU14445 fts, done HEMBB1001294, highly similar to GTP-BINDING 

AA340111 Hs.100009 acyPCoenzyme A oxidase 1 , paimltoyl 

AF125044 Hs.19196 uWquitin-con]ugatlng enzyme HBUCE1 

R81733 H&33106 ESTs 

BE255499 Hs.3496 hypothetical protein MGC15749 

AA256444 Hs.126485 hypothefical protein FU 12604; KIAA1692 protein 

W79283 Hs.35962 ESTs 

W92798 Hs.269181 ESTs 

AW602528 gb:RC5-8T0562-2601 00-01 1 -A02 BT0562 Homo sapiens cONA, mRNA sequence 

A1416988 Ks.238272 Inositol 1 ,4,54riphosphate receptor, type 2 

BE548222 Hs.299883 hypothetical protein FU23399 

AF052212 Hs.153934 core-binding factor, runt domain, alpha subunlt 2; translocated to. 2 

AW069534 Hs.279583 CGW1 protein 

AA251380 Hs.10726 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AM79813 Hs.278411 NCK-assodated protein 1 
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114277 RCJ40377_s A1052229 Ks.25373 
[Celegans] 

114304 RC_Z40820 AI934204 Hs.1 6129 

114364 RW416B0 AL117427 Hs.172778 

5 132900 RCJW005112 AA777749 Hi597B 

129034 RCJVA005432 AA481157 Hs.108110 

131881 RCJW0163 AW361018 Hs.3383 

452461 RCJU026356 N78223 Hs.108106 

114465 RCJ\A026901 BE621056 Hs.131731 

10 131376 RCJW036867 AKD01644 Hs^6156 

101567 RCJ\AQ44644 M33552 Hs.56729 

431555 RC.AAQ46426 AI815470 Hs.260024 

132944 RCJW054515 T98641 Hs.6127 

114618 RCJ\A084162 AW979261 Hs.291993 

15 130274 RC.AA085749 AA128376 Hs.153884 

110330 RCJW098874 A1286666 Hs.16621 

114648 RC.M101056 AA101056 
IMAG&548429 3' 

114658 RCJ\A102746 AA102383 Hs.249190 

20 132456 RC_AA114250_s AB011084 Hs.48924 

131319 RCJ\A126561_s NMJJ03155 Hs.25590 

132225 RCJW128980J AA128980 
!MAGE:5671643' 

132669 RCJ\A129757 W38586 Hs.293981 

25 114709 RCJW129921 AA397651 Hs.301959 

131973 RCJVA1 33331 AB018284 Hs.1 58688 

114750 RCJNA135958 AA887211 Hs.129467 

115714 RCJ\A136524_s T19228 Hs,172572 

114763 RCJVA147044 AA810755 Hs.88977 

30 114767 RC.AA148885 A1859865 Hs.154443 

114774 RCJ\A15Q043 AV656017 Hs.184325 

129388 RCJW151621 AA662477 Hs.1 10964 

129183 RC_AA155743 BE561624 Hs.273369 

128869 RC _AA1 56335 AA768242 Hs.80618 

35 130207 RCJ\A1 56336 AF0442O9 Hs.144904 

114798 RCJVA1S9181 AA159181 Hs.54900 

114800 RC_AA159825 219448 Hs. 131887 
[C.elegans] 

114828 RCJ\A234185 AA252937 Hs.283522 

40 114846 RC.AA234929 BE018682 * Hs. 166196 

114848 RCJ\A234935 BE614347 Hs.169615 

114902 RCJU236359 AW275480 Hs.39504 

132271 RCJ\A236466 AB030034 Hs.1 15175 

114907 RC^AA236535 N29390 Hs.13804 

45 135159 RCJ\A236935_s U43374 Hs.95631 

132204 RCJ\A236942 AA235827 Hs.42265 

114928 RCJ\A237018 AA237018 Hs.94869 

132481 RC.AA237025 W93378 Hs.49614 

114932 RCJU242751 AA971436 Hs.16218 

50 314162 RC.M242760 BE041820 Hs.38516 

131006 RCJU242763 AF064104 Hs.22116 

114935 RCJU242809 H23329 Hs.290880 
WARNING 

132454 RCJU243133 BE296227 Hs.250822 

55 437754 RCJU243495 R60366 Hs.5822 

114957 RCJ\A243706 AW170425 H3.87680 

114974 RCJ\A250848 AW966931 Hs.179662 

114977 RCJW250868 AW296978 Hs.87787 

114995 RG.AA251152 AA769266 Hs.193657 

60 115005 RC_AA251544_s AI760825 Hs.111339 

417177 RCJ\A251792 NM.004458 Hs.81452 

131889 RCJ\A252063 NMJJ02589 Hs.34073 

115026 RCJ\A252144 AA251972 Hs.188718 

115045 RCLM252524 AW014549 Hs.58373 

65 115068 RCJW253461 AW512260 Hs.87767 

133138 RCJW255522 AV657594 Hs.181161 
RECEPTOR, 

115114 RC.AA256468 AA527546 Hs.7527 

129584 RCJW256528 AV656017 Hs.184325 

70 115137 RCJ\A257976 AW968304 Hs.56156 

134312 RCJ\A258296 AB011151 Hs.334659 

115166 RO J AA258409 AF095727 Hs.287832 

115167 RCJ\A258421 AA749209 Hs.43728 
129807 RCJW262077 Y11192 Hs.5299 

75 115239 RCJ\AZ78650 BE251328 Hs.73291 

115243 RCJVA278766 AA806600 Hs.1 16665 



ESTs, Weakly similar to 720410 hypoffieflcal protein E02A10.2 - Caenorhabdifis elegans 
ESTs 

Homo sapiens mRNA; cDNA OKFZp566P013 (from clone DKFZp566P013) 

UM domain only 7 

OKFZP547E2110 protein 

upstream regulatory element binding protein 1 

transcription factor 

hypothetical protein FU 11099 

rrypofteficaJ protein FU10782 

lysosomal 

Cdc42 effector protein 3 

Homo sapiens cONA: FU23020 fis, done LNG00943 
ESTs 

ATP binding protein associated with cell differentiation 
DKFZP434H16 protein 

gb:zn25b03.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cONA clone 

tumor necrosis factor receptor superfamlry, member 10a 
KIAA0512 gene product ALEX2 
stannjocaiom i 

gfczo09a1 U1 Stratagene neuroeplihellum NT2RAMI 937234 Homo sapiens cDNA done 

guanine nucleotide binding protein (G protein), gamma 3, linked 
proline synthetase co-transcribed (bacteria) homotog) 
WAA0741 gene product 
ESTs 

hypothetical protein FU20093 

hypothetical protein CU511E16.2 

mlnlchromosome maintenance deficient (S. cerevlsiae) 4 

CGl-76 protein 

hypothetical prater FU23471 

uncharacterized hematopoietic stem/progenitor ceils protein MDS027 

hypothetical protein 

nuclear receptor co-represser 1 

serologically defined colon cancer antigen 1 

ESTs, WeaMy similar to T24396 hypothetical protein T03F6.2 - Caenorhabdifis elegans 

Homo sapiens mRNA; cDNA DKFZp434J1912 (from done DKFZp434J1912) 

ATPase, Class I, type 8B, member 1 

hypothetical protein FU 20989 

hypothetical protein MGC4308 

sterile-alpha motif and leucine zipper contaWng kinase AZK 

hypothetical protein (1J462023.2 

Human normal keratinocyte mRNA 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, clone MGC:15887, mRNA, complete cos 
CDC14 (cell division cyde 14, S. cerevisiae) homotog B 

ESTs, Wealdy similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

serine/threonine kinase 15 

Homo sapiens cDNA: FU22120 fis, clone HEP18874 

ESTs 

nudeosome assembly protein 1-iike 1 

ESTs 

ESTs 

ESTs 

fatfy-ada\Coenzyme A Pgase, long-chain 4 

BH-protocadherln (brain-heart) 

ESTs 

ESTs 

ESTs 

Homo sapiens cONAfU 14643 fis, done NT2RP2001597, weakly similar to RYANODINE 

small fragment nuclease 
CGI-76 protein 
ESTs 

hypothetical proteki MGC14139 
myelin protein zero-like 1 
hypothetical protein . 

aldehyde dehydrogenase 5 family, member A1 (succinate-semiaJdenyde dehydrogenase) 
hypothetical protein FU10881 
KIAA1842 protein 
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100850 RCJ^A279667_s 

126884 RCJU280791 

115322 RCJVA28Q819 

133626 RCJVA280828 

115372 RCJ\A282195 

132825 RCJ\A2B3127ji 

130269 RCJW284694 

129192 RCJ\A291137 

452598 RCJU291708 
WARNING 

132131 RCJ\A293495 

115536 RC.AA347193 

132411 RCJVA398474J 

115575 RCJ\A398512 

115601 RC_AA400277 
WARNING 

103928 RCJW4O0896 

125819 RCJW404494 

115683 RCJW410345 

115715 RCJU416733 

132952 RCJVW25154 

115819 RC_AA426573 

132525 RCJ\A431418 

115895 RCJ\A436182 

132333 RCJW437099 

115962 RCJW46585 

115967 RCJ\A446887 

115974 RCJW47224 

115985 RCJK447709 

129254 RCJ\A453624 

133071 RCJ\A455044 

116095 RC_AA456045 

122691 RC_ J AA460454_8 

116210 RCJK476494 

116213 RCJVA476738 

134585 RCJVA481422 

134790 RCJW482269 

116265 RCJW482595 

129334 RCJ\A485084j 

116274 RCJW485431.8 

303150 RCJVA489057 

129945 RCJ\A489638 

116331 RCJM91000 

116333 RCJW491250 

132994 RCJW505133 

134577 RCJVA598447 

116391 RC_AA599243 

116394 RC.AA599574J 

134531 RCJ\A600153 

116417 RCJ\A609309 

116429 RC.M609710 

116439 RCJW610068 

116459 RC.AA621399 

427505 RCJ\A621752 

132699 RC.C21523 

116541 RC.D12160 

132557 RCJ)19708 

112259 RCJ)25801 

116571 RCJM5652 
sequence. 

129815 RCJD60208J 

421919 RCJ»05O4_s 

116643 RC_F03010 

116661 RCJ=04247 
repetitive 

116715 rc_fio886 

116729 RC_/137U0 

318709 RC.H05063 

134760 RC_H16758 

116773 RC_H17315_s 

106425 RC_H22556 

116780 RC_H22566 

131978 RC_H48459_s 

116819 RC_H53073 

111428 RCJi55559_s 

133175 RCJti7957_s 



AA836472 
U49436 



AW836130 

AW014385 

U82671 

F05422 

AA286914 

AI831594 



AKD01468 
AA059412 
AA393254 
AA148984 

D14540 

AA044840 

AF255910 

BE395161 

A1658580 

AA486620 

AW292809 

AB033035 

AA 192669 

AI636361 

AI745379 

BE513442 

AA447709 

AA252468 

BE384932 

AA043429 

R19768 



AA292105 

D14041 

BE002798 

BE297412 

AW157022 

AJ129767 

AA887146 

BE514376 

N41300 

AF155827 

AA1 12748 

BE244323 

T86558 

NM_006033 

AI742845 

AW499664 

AF191018 

AA251594 

R80137 

AA361562 

AW449822 

D12160 

AA114926 

AA337548 

D45652 

BE565817 
AJ224901 
AI367044 
R61504 

AL1 17440 

BE549407 

R52576 

NMJJ0Q121 

AI823410 

H24201 

H22566 

AA355925 

H53073 

AL031428 

AW955632 



Hs£97939 

Hs386236 

Hs.78995 

Hs.75277 

Hs.88678 

Hs.57698 

Ha.168352 

Hs. 183299 

Hs.68647 

Hs.40539 
Hs.62180 
Hs.47986 
Hs.43619 
Hs.48849 

Hs.199160 
Hs^51871 
Hs.54650 
,Hs.1390 
Hs.61426 
Hs.41135 
Hs.50727 
Hs.51965 
Hs.45032 
Hs.179520 
Hs.42911 
Hs.238944 
Ks.268115 
Hs.1098 
Hs.64313 
Ks.62618 
Hs.172788 
Hs.172788 
Hs.326740 
Hs.278573 
Hs.287850 
Hs.55189 
Hs.4947 
Hs.162874 
Hs.8217 
Hs.165998 
Hs71616 
Hs.203963 
Hs.279905 
Hs.85951 
Hs.75113 
Hs.65370 
Hs.110713 
Hs.12484 
Hs.279923 
H&43913 
Hs.302738 
Hs.178761 
Hs.55200 
Hs.249212 
Hs.5122 
Hs-333402 



Hs26493 
Hs.109526 
Hs.153638 



Hs.170263 
Hs.115823 
Hs.285280 
Hs.89548 
Hs.169149 
Hs.247423 
Hs.30098 
Hs.36232 
Hs.93698 
Hs.174174 



B 

KIAA1856 protein 

MADS box transcnpfion enhancer factor 2, polypeptide C (myocyte enhancer factor 2C) 

hypothetical protein FU13910 

ESTs, Weakly similar to Unknown [Usaplens] 

Empirically selected from AFFX single probeset 

nucteopon*n-49ce ptotein 1 

ESTs 

ESTs. Weakly similar to ALU7 JWMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

chromosome 8 open reading frame 1 

anCHn (Drosophfia Scraps homolog), actin binding protein 

hypothetical protein MGC10940 

ESTs 

ESTs, Weakly similar to ALU4 .HUMAN ALU SUBFAMILY S82 SEQUENCE CONTAMINATION 

myeloid/lymphoid or mixed-lineage leukemia (trithorax (Drosophfia) homolog) 
CTP synthase 

Junctional adhesion molecule 2 

proteasome (prosome, macropaln) subuntt beta type, 2 

Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 

endomudn-2 

N^tytgiucosairfoldase, alpha- (Sanfliippo disease I1IB) 

K1AA1209 protein 

ESTs 

hypothetical protein MGC10702 
ESTs 

hypothetical protein FU 10631 

ESTs, Weakly slmfer to T08599 probable transcription factor CA150 [H.sapiens] 
DKFZp434J1813 protein 

ESTs, Weakly similar to AF2571 82 1 G-protein-coupled receptor 48 [H.sapiens] 
ESTs 

ALEX3 protein 
ALEX3 protein 

hypothetical protein MGC10947 

H-2K binding factor-2 

integral membrane protein 1 

hypothetical protein 

hypothetical protein FU22584 

guanine nucleotide binding protein (G protein) alpha 12 

stromal antigen 2 

PAM mRNA-blnding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (from clone DKFZp586N1720) 

hypothetical protein FU10339 

clone HQ0310PRO0310p1 

exportin, fRNA (nuclear export receptor for tRNAs) 

general transcription factor I1IA 

lipase, endothelial 

DEK oncogene (DMA binding) 

Human done 23826 mRNA sequence 

putative nucleotide binding protein, estradtoWnduced 

PIBF1 gene product 

Homo sapiens cOMA: FU21425 (Is, done COL04162 
26S proteasome-assodated pad1 homolog 
ESTs 

polymerase (RNA) 111 (DNA directed) (155kD) 
ESTs 

hypothetical protein MGC12760 

gb;HUMGS02848 Human adult lung 3* directed Mbol cDNA Homo sapiens cDNA J, mRNA 

hypothetical protein FU21657 
zinc finger protein 198 

myetokWymphold or mixed-lineage leukemia 2 

gbyh16a03.s1 Soaras infant brain 1NIB Homo sapiens cDNA done 3' simflar to contains Alu 

tumor protein p53-binding protein, 1 

ribonuctease P, 4QkO subunlt 

Homo sapiens cDNA: FU22096 lis, clone HEP16953 

erythropoietin receptor 

karyopherin alpha 1 (importin alpha 5) 

adducin2(beta) 

ESTs 

KEAA0186 gene product 
EST 

KIAA0601 protein 

ESTs, Weakly similar to S19560 proline-nch protein MP4 - mouse [M.musculus] 
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116844 RC_H64938_s H64938 Hs.337434 

116845 RC_H64973 AA649530 

116892 RC_H69535 A1573283 Hs.38458 

116925 RC_H73110 H73110 Hi260603 

5 116981 RCJW1783 N29218 Hs.40290 

131768 RC_H66259 AC005757 Hs.31B09 

117031 RC.HB8353 H88353 
contains L1 

117034 RC_HS8639 U72209 Hs.180324 

10 132542 RCJ188675 AL137751 Hs^63671 

134403 RC.H93708_s AA334551 Hs.82767 

117280 RCJJ22107 M18217 Hs.172129 

117344 RCJJ24046 R19085 Hs.210706 

117422 RCJJ27Q28 A1355562 Hs.43880 

15 117475 RC N3Q2Q5 N30205 Hs.93740 

117487 RCJJ30621 N30621 Hs.44203 

130207 RCJB3258 AF0442O9 Hs.1 44904 

117549 RCJJ33390 N33390 Hs.44483 

117683 RCJJ40180 N40180 

20 1MAG£:276387 7 similar to 

117710 RC.N45198 N45198 Hs.47248 

104514 RCJM5979jj AF164622 Hs.182982 

117791 RCJM8325 N48325 Hs.93956 

117822 RCJ448913 AA706282 Hs.93963 

25 129647 RCJM9394 AB016259 Hs.1 18140 

117895 RC_N50656 AW450348 Hs.93996 
[risapfens] 

131557 RC.N50721 AA317439 Hs.28707 

133057 RC.N53143 AA465131 Hs.64001 

30 118103 RCJI55326 AA401733 Hs.184134 

118111 RCJJ55493 N55493 
mRNA 

118129 RC N57493 N57493 
IMAGE:2773583 , ,mRNA 

35 118278 RCJ462955 N62955 Hs.316433 

118329 RC.N63520 N63520 
3', mRNA 

118336 RC_N63604 BE327311 Hs.47166 

132457 RCJI64166 AB017365 Hs.173659 

40 118363 RC.N64168 A1183838 Hs.48938 

118364 RCJJ64191 N46114 Hs.29169 

118475 RC.N66845 N66845 
similar to 

118491 RCJJS7135 AV647908 Hs.90424 

45 118500 RCJJ67295 W32889 Hs.154329 

101663 RCJJ68399 NMJM3528 Hs.2178 

118584 RCJJ68963 AW136928 
sequence 

421983 RC.N69331 AI252640 Hs.1 10364 

50 118661 RCJJ70777 AL137554 Hs.49927 

118684 RC_N71364_S N71313 Hs.163986 

118689 RC_N71545_s AW390601 Hs.184544 

118690 RC.N71571 N71571 Hs.269142 
118766 RC_N74455 N74456 Hs.50499 

55 118793 RCJJ75594 M75594 Hs.285921 

118817 RCJ479035 A1668658 Hs.50797 

118844 RCJW0279 AL035364 Hs.50891 

118919 RCJW1797 AW452696 Hs.130760 

129558 RCJ492454 AW580922 Hs.180446 

60 132692 RCJJ94581 AW191962 Hs.249239 

118996 RCJ494746 N94746 Hs.274248 

119021 RC.N98238 N98238 Hs.55185 

119039 RC.R02384 AJ160570 Hs.252097 

119063 RC.R16833 R16833 Hs.53106 

65 WARNING 

118523 RC_R41828_S Y07759 Hs.170157 

119111 RCJW32Q3 T02865 Hs.328321 

133970 RC.R46395 AA214228 Hs.127751 

119146 RCJR58863 R58863 Hs.91815 

70 120296 RCJR78248 AW995911 Hs.299883 

119239 RCJ11483 T11483 
sequence. 

119281 RCJ16896 AI692322 Hs.65373 

119298 RC.T23820 NMJJ01241 Hs.155478 

75 126502 RCJ30222 T10077 Hs.13453 

135073 RCJV15275_s W55956 Hs.94030 



ESTs, Weakly sinter to A46010 X-flnked retinopathy protein [Haptens] 
gbms44flJ5^ 1 NCLCGAPJUvl Homo sapiens cDNA done, mRNA sequence 
ESTs 

ESTs, Moderately similar to A47582 8-ceH grow&i factor precursor [Haptens] 
ESTs 

nypotneocai pnoisfi 

gb:yw21a0Zs1 Morton Fetal Cochfea Homo sapiens cONA done (MAGE252842 3* simSar to 
YY1 -associated factor 2 

Homo sapiens mRNA; cDNA OKFZp434l0812 (from done DKFZp434i0812); partial cds 
sperm specific arrf^en 2 

Homo sapiens cDttk FU21409 Us, clone COL03924 

Homo sapiens cDNA HJ13182 fis, done NT2RP30O4070 

ESTs, Weakty similar to A46010 X-Hnked retinopathy protein Hsapiens] 

ESTs. Weakly similar to 138022 hypotheticaJ protein [Hsapiens] 

ESTs 

nuclear receptor co-repressor 1 
EST 

gb:yy44d02.s1 Soaresjnulflple.scterosJs JNbHMSP Homo sapiens cDNA done 

ESTs. HSghiy similar to similar to Cdc14B1 phosphatase [H.saplens] 

gokjin-67 

EST 

ESTs 

KIAA0716 gene product 

ESTs, Highly similar to SORLHUMAN SORTIUN-RELATED RECEPTOR PRECURSOR 

signal sequence receptor, gamma (translocon-assodated protein gamma) 

Homo sapiens clone 25218 mRNA sequence 

ESTs 

gb:yv50cOZs1 Scares fetal Over spleen 1NFLS Homo sapiens cDNA done IMAGE246146 3'. 

gb:yy54c08.s1 Scares_mu!tiple_sderc«isJ2NoHMSP Homo sapiens cDNA clone 

Homosaplens CONAFU11375 fis. done HEMBA1 00041 1 , weaJdy similar to ANKYRIN 
gb7y62»U1 Soaresjrwltiple.sderosis JNbHMSP Homo sapiens cDNA done IMAGE:278137 

HT021 

frizzled (Drosophfia) homo log 7 
hypothetical protein FU21 802 
hypothetical protein FU22623 

gb:za46d 1 &\ Scares fetal Over spleen 1NFLS Homo sapiens cDNA done 1MAGE295604 3* 

Homo sapiens cDNA FU23285 fis. clone HEP09071 
ESTs 

H2B hlstone family, member Q 

gb:lll-H-BJ1-adp^08^-UU1 NCI_.CGAP_.Sub3 Homosaplens cDNA done 3', mRNA 

peprjdylprotyl feomerase C (cydophilln C) 
protein kinase NYD-SP15 

Homo sapiens cDNA: FU22765 fis, done KAIA1 1 80 
Homo sapiens, clone IMAGE:3355383, mRNA. partial cds 
ESTs 
EST 

ESTs, Moderately similar to T47135 hypothetical protein DKFZp761L0812.1 JH.sapiens] 
ESTs 

hypotheticaJ protein 

myosin phosphatase, target subuntt 2 

karyopherin (Importm} beta 1 

cotegen, type VIII, alpha 2 

hypothetical protein FU2Q758 

ESTs 

pregnancy specific beta-1 ^rycoproiein 6 

ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

myosin VA (heavy polypeptide 12, myoxin) 
EST 

hypothetical protein 
ESTs 

hypothetical protein FU23399 

gb.-CHR90049 Chromosome 9 exon Homo sapiens cDNA clone 1 1 1-1 5 1 and 3\ mRNA 

ESTs, Weakty similar to T02345 hypothetical protein K1AA0324 [H .sapiens] 
cycCnT2 

hypotheticaJ proteh FU14753 

Homo sapiens mRNA; cONA DKFZp586E1624 (from clone DKFZp586E1624) 
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119558 RCJY38194 W38194 

132736 RC.W42410 AW081883 
sapiens mad protein 

132173 RC_W46577_s X89426 

5 134873 RCJV49S32J* AA884471 

119650 RCJV57613 R82342 

119654 RC_W57759 W57759 
similar to 

119683 RC W61118 W65379 

10 119694 RC W65344 AA041350 

119718 RCJV69216 W69216 

133010 RC W69379 A1287518 

119938 RCJV86728 AW0 14862 

120128 RCJ38499 BE37932Q 

15 120130 RCJO8630 AA045767 

120148 RC_Z39494 F02808 

120155 RCJ39623 Z39623 

131486 RC_Z40071_3 F06972 

120183 RC.Z40174 AW082866 

20 120184 RC_Z40182 Z40182 

120211 RC^40904 240904 

120245 RCJVA166965 AW959615 

120247 RC_AA167500 AA167500 
13)254 RCJ\A169599_s W90403 

25 120259 RCJW171724 AW014786 

120260 RCJW171739 AK000061 

120275 RCJW177105 AA177105 

120284 RCJW182626 AA179656 
to contains 

30 114056 RCJW86324 AA188175 

129507 RCJW192099 AJ236885 

120302 RC.AA192173 AA837098 

120303 RC_AA192415 AI216292 
120305 RC _M192553 AW295096 

35 120319 RC_AA194851 T57776 
133389 RCJW195520.S AA195764 

120326 RCJW196300 AA196300 

134272 RCJW196517 X76040 

133145 RCJW96549 H94227 

40 120327 RCJW196721 AK000292 

106686 RCJW96729J N66397 

120328 RCJW196979 AA923278 

120340 RC.AA206828 AA206828 
slmflar to 

45 134292 RCJW207123 A1906291 

131522 RCJ^214539J AI380040 
129051 RCJ\A226914_s AA227068 

120375 RCJW227260 AF028706 

120376 RCJVA227469 AA227469 
5 0 IMAGE:663732 3', mRNA sequence. 

120390 RCJW233122 AA837093 
303876 RCJ\A233334_s U64820 
dominant, ataxln 3) 

132038 RC.M233347 AI825842 

55 104463 RCJW233519 T85825 

125750 RCJ\A233714 AA018515 

120396 RC.AA233796 AA134006 

120409 RC.AA235050J AA235050 
gb!07077 

60 120414 RC.AA235704 AW137156 

120420 RCJW236031 AI128114 

120422 RC_AA236352 AL133097 
132221 RC_M236390_s W94915 

120423 RCJK236453 AA236453 
65 120435 RCJW243370 AA243370 

120453 RCJW250947 AA250947 

120455 RCJW251083 AA251720 

120456 RCJ^A251113 AA488750 
120473 RCjyA251973 AA251973 

70 128922 RCJW252023 AI244901 

120477 RCJVA252414 AA252414 

120479 RCJW252650 AF006689 

120488 RCJW255523 AW952916 

120510 RCJVA258128 AI796395 

75 120527 RCJ\A262105 AA262105 

120528 RCJVA262107 A1923511 



Empirically selected from AFFX single probeset 

H&288261 Homo sapiens cONA: RJ23037 fis, done LNG02036, highly similar to HSU68019 Homo 

Hs.41716 endothelial cefl-specifc molecule 1 

Hs.90449 Human done 23908 mRNA sequence 

Hs.79856 ESTs, Weakly similar to S85657 alpra-IC^adrenergtereceptorspiice form 2 (H.sapiens] 

gb:zrf20g11.s1 Scaie^fetaUiearLNbHH19W Homo sapiens cDNA done IMAGED 1252 3* 

HS57835 ESTs 

Hs.57847 ESTs, Moderately similar to 1CE4 JiUMM CASPASE-4 PRECURSOR (Haptens] 

Hs.92848 ESTs 

Hs.62669 Homo sapiens mRNA; cDNA DKFZp586D0923 (from clone DKFZp586D0923) 

Hs.58885 ESTs 

Hs.91448 MKP-1 like protaln tyrosine phosphatase 

Hs.5300 bladder cancer associated protein 

Hs.65765 ESTs 

Hs.65783 ESTs 

H&27372 BMX non-receptor tyrosine kinase 

Hs.65882 ESTs 

Hs.65885 EST 

Hs.66012 EST 

Hs.111045 ESTs 

Hs.103939 EST 

Hs.111054 ESTs 

Hs.192742 hypoflwflcaJ protein RJ12785 

Hs.101590 hypo&ieficaJ protein 

Hs,78457 solute carrier family 25 {mitochondrial earner; ornithine transporter) member 15 

gbap54e1 U1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 3* slmflar 

Hs.82506 K1AA1254 protein 

Hs.112180 zinc finger protein 148(pHZ^2) 

H:l269933 ESTs 

Hs.96184 ESTs 

Hs.101337 uncoupling protein 3 (mitochondrial, proton carrier) 

Hs.191094 ESTs 

Hs.72639 ESTs 

Hs.21145 hypothetical protein RG033M05.2 

Hs.278614 protease, serine, 15 

Hs.6592 Homo sapiens, done IMAGE2961368, mRNA, partial cds 

Hs.278732 hypometical protein FU20285 

Hs.334825 Homo sapiens cDNA FU14752 fis, clone NT2RP3003071 

Hs.290905 ESTs. Weakly similar to protease [H-sapiensl 

gb:zq80bO8.s1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone IMAGE:647895 7 

Hs.81234 tmmunog tobuCn superfamily, member 3 

Hs.239489 T1A1 cytotoxic granule-associatBd RNA-Wnding protein 

Hs.108301 nudear receptor subfamily 2, group C. member 1 

Hs.1 11227 Zic family member 3 (odd-paired Orosophila homolog, heterotaxy 1) 

gb:zr18a07.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA done 

Hs.111460 caldum/cairnodutirHlependent protein kinase (CaM kinase) II delta 

Hs.66521 Macradc-Joseph disease (spinocerebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

Hs.3776 zinc finger protein 216 

Hs.246885 hypothetical protein FU20783 

Hs.264482 Homo sapiens mRNA; cDNA OKFZp761 A041 1 (from done DKFZp761A0411) 

Hs 79306 eukaryotic translation fnffiation factor 4E 

gb:za38e04.s 1 Scares JJhHMPu_S1 Homo sapiens cDNA done IMAGES87486 3* similar to 

Hs.181202 hypothetical protein FU1G038 

Hs.1 12885 spinal cord-derived growth factor-B 

Hs.301717 hypothetical protein DKFZp434N1928 

Hs.42419 ESTs 

Hs.18978 Homo sapiens cDNA: FU22822 fis, done KAIA3968 

Hs.96450 EST 

Hs.170263 tumor protein p53-bindlng protein, 1 

Hs.104347 ESTs, Weakty similar to ALUC JiUMAN till ALU CLASS C WARNING ENTRY 111 (H^aplensl 

Hs.88414 BTB and CNC homology 1, basic leucine zipper transcription factor 2 

Hs.269988 ESTs 

Hs.9589 ublquiflnl 

Hs.43141 DKFZP727C091 protein 

Hs.1 10299 mltog en-activated protein kinase kinase 7 

Hs.63510 KIAA0141 gene product 

HS.1 11377 ESTs 

Hs.4094 Homo sapiens cDNA FU 14203 6s, clone NT2RP3003264 

Hs.104413 ESTs 
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120529 RCJU262235 
120541 RCJ\A278298 
131445 RCJ\A278529J 
120544 RC.AA278721 
120562 RCJVA280036 
120569 RCJW280648 

120571 R<LAA280738 

120572 RC_AA280794 
129434 RCJW280837 
130529 RC_AA28Q886 



120575 RC_AA280934 
132635 RC _AA281535 
120591 RC_AA281797_s 
120593 RCJU282047 
430275 RCJ\A283002 
117729 RCJW283709 
120609 RCJW283902 
132754 RCJtt284108 
130315 RCJ\A284109 
132614 RCJW284371 
447503 RCJ\A284744J 
cds 

135376 RCJVA284784 
120621 RC_AA2S4840 
107868 RCJW286844 
129868 RCJ^A287032 
120644 RCJW287038 

120660 RCJW287546 
135370 RCJW287553_s 

120661 RC AA287556 
129116 RCJW287564 
131567 RCJW291015.S 
120699 RCJVA291716 
100690 RCJ\A291749_s 
120726 RC_AA293656 
120737 RCJ\A302430 
120745 RCJW302809 
135192 RC_AA302820_s 
120750 RCJW310499 
120761 RC.AA321890 

120768 RC_AA340589 

120769 RCJW340622 
135232 RCJ\A342457J 
CONTAMINATION 
133439 RC_AA342828_s 
120793 RCJ\A342864 
120796 RCJ\A342973 
120809 RC_AA346495 
repeat mRNA sequence. 
132459 RCJW347573 
120825 RCJW347614 
120827 RCJW347717 
120839 RCJW348913 
repeat mRNA sequence. 
120850 RC.AA349647 
120852 RC_AA349773 
128852 RCJW350541_s 
135240 RCJW357159J 
120870 RCJ\A357172J 
WARNING 

134637 RCJ\A369856_s 
120894. RCJW370132 
131854 RCJ\A370472_s 
120897 RCJW370867 
120915 RC_AA377296 

120935 RC_AA383902 
WARNING 

120936 RCJW385934 

120937 RCJW386255 

120938 RCJ\A386260 
129722 RC_ J AA386266 
120960 RCJW398014 
120985 RCJ\A398222 
120988 RC.M398235 



A1434823 Hs.104415 ESTs 

W07318 Hs.240 Mohase phcsphoprotein 1 

NM.014264 Hs.172052 serirafflueonine kinase 18 

BE548277 Hs.103104 ESTs 

BE244580 Hs ,302287 hypofteflcal protein FU10330 

AA807544 Hs£4970 ESTs, WeaMy sJmDarto B34323 GTP-binding protein Rab2 (H^apJens] 

AB037744 Hs.34892 WAA1 323 protein 

H39599 Hs.294008 ESTs 

AW967495 Hs, 166644 ESTs 

AA178953 gbzp39e03.s1 Stratagene muscle 937209 Homo sapiens cDNA done 3* similar to contains Alu 

AW978022 Hs73891 1 hypothetical proteki DKFZp762E151 1; WAA1816 protein 

AB020686 Hs.54037 ectonudeotide pyrophosphatase/phosphodiesterase 4 (putative function) 

AF078847 Hs. 191 356 general transcription factor HH, polypeptide 2 (44kD subunif) 

AA748355 Hs.193522 ESTs 

Z11773 Hs.237786 zinc finger protein 187 

AA306166 Hs.7145 calpah7 

AW978721 Hs.266076 ESTs, WeaWy stmflar to A46010 X-Hnked refinopathy protein [Haptens] 

AI752244 Hs.75309 eukaryoOc translation etongafion factor 2 

AI241084 Hs.154353 nonselective sodium potassium/proton exchanger 

AA284371 Hs. 118064 similar to rat nuclear ubiquitous casein kinase 2 

AA1 15496 Hs.336898 Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, done MGC:9890 f mRNA. complete 

8E617856 Hs.99756 rrutodwndnal r&osome recyc&ng factor 

AW961294 Hs. 14381 8 hypothetical protein FU23459 

AA286844 Hs.61260 hypothetical protein FU1 31 64 

AW172431 Hs.13012 ESTs 

AI869129 Hs.96616 ESTs 

AA286785 Hs.99677 ESTs 

BE622187 Hs.99670 ESTs, Weakly similar to 138022 hypothetical protein (H^aptens] 

AA287556 Hs.263412 ESTs, WeaWy similar to ALUELHUMAN Ml ALU CLASS B WARNING ENTRY Hi [Helens] 

AB019494 Hs.225767 (DN3 protein 

AF015592 Hs.28853 COC7 (cell division cycle 7, S. cerevfciae, homologHite 1 

AI683243 Hs.97258 ESTs, Moderately similar to S29539 rfoosomal protein 113a. cytosolic [Haptens] 

AA383256 Ha. 1657 estrogen receptor 1 

AA293655 Hs.97293 ESTs 

AL049176 Hs.82223 chordWike 

AA302809 gb£ST 10428 Adipose tissue, white I Homo sapiens cDNA 3* end, mRNA sequence. 

U63993 Hs.321709 purinerglc receptor P2X. Cgand-gated ion channel 4 

AI191410 Hs.96693 ESTs, Moderately similar to 210926QA B ceD growth factor (H^apiens) 

AA321690 Hs.1265 branched chain kato acid dehydrogenase E1, beta polypeptide (maple syrup urine disease) 

AA340589 Hs.104560 EST 

AI769467 Hs.96769 ESTs 

AL036812 Hs.96800 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE 

Z23091 Hs.73734 glycoprotein V (platelet} 

AA342864 Hs.96812 ESTs 

AI247356 Hs.96820 ESTs 

AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3" end similar to EST containing O family 

AL120071 Hs.48998 fibronectin leucine rich transmembrane protein 2 

AI280215 Hs.96885 ESTs 

AA382525 Hs. 132967 Human EST clone 122887 manner transposon Ksmart sequence 

AA348913 . gbf ST55442 Infant adrenal gland II Homo sapiens cDNA J end similar to EST containing Alu 

AA349647 Hs.96927 Homo sapiens cDNA FU12573 fts. clone NT2RM4000979 

AA349773 Hs.191564 ESTs 

R40622 Hs.106601 ESTs 

AA357159 Hs.96986 EST 

AA357172 Hs.292581 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

U87309 Hs.180941 vacuolar protein sorting 41 (yeast homotog) 

AA370132 Hs.97063 ESTs 

AF229839 Hs. 173202 Wcappa^interacCng Ras-Gke protein 1 

AA370867 Hs.97079 ESTs, Moderately similar to AF174605 1 F-box protein Fbx25 [H^aptensJ 

AL135556 Hs.97104 ESTs 

ALO48409 Hs.97177 ESTs, Weakly similar to ALU1 JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AA385934 Hs.97184 EST, Highly similar to (deffine not available 7499603) [C.eJegans] 

AA386255 Hs.971B6 EST 

AA386260 Hs. 104632 EST 

R20855 Hs.5422 gtycoprotein M6B 

AA398014 Hs.104684 EST 

AI219896 Hs.97592 ESTs 

AA398235 Hs.97631 ESTs 
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121008 RCJW398348 
GSSsandaCpG 
121029 ROM398482 

121032 RCJVA3S8504 

121033 RC.AA3985Q5 

121034 RCJ\A398507 

121035 RC.AA398523 
121058 RC.AA398625 

121060 RC.AA398632 

121061 RC.AA398633 

121091 RC.M398894 
CONTAMINATION 

121092 RCJW398895 
121094 RCJVA398900 
121096 RCJ\A398904 
121115 RCJ\A399122 

121121 RCJW399371 

121122 RCJW399373 
121125 RCJW399441 
121151 RCJW399636 
121153 RCJVA399640 
121163 RCJW399680 
121176 RC_AA400080 
121192 RCJW400262 
121223 RCJW400725 
121227 RCJ\A400748 
121231 RCJW4Q0780 

121278 RC.M401631 

121279 RC.M401686 
121282 RCJW401695 
121299 RCJW402227 

121301 RCJW402329 

121302 RCJ\A402398 

121304 RCj\A402449 

121305 RCJVA402468 
134721 RCJW403268_s 

121323 ROM403314 

121324 RCJW404229 
129047 RCJW404260 
131074 RCJ\A404271 
121344 RCJW405026 
121348 RCJW405182 
121350 RC_AA4Q5237 
contains Alu 

121400 RCJW06061 

121402 RCJ\A406063 

121403 RCJW06Q70 
121408 RCJ\A406137 
121431 RCJW406335 
132936 RC_AA411801 
121471 RCJVU11804 
121474 RCJVA411833 
121526 RCJW412219 

.121530 RCJW412259 

121558 RCJUW12497 
contains 11.0 L1 

121559 RCJUV412498 
121584 RC_AA4165B6 
121609 RCJ\A416867 
121612 RCJ\A416874 
121737 RC.AA421133 
121740 RCJW421138 
129194 RCJW422Q79 . 
121784 RC.AA423837 

121802 RC_M424328 

121803 RCJVA424339 
135286 RCJ\A424469_s 
121806 RCJVA424502 
129517 RCJW425004 
121845 RCJ\A425734 
CONTAMINATION 
121853 RCJ\A425887 
121891 RC_AA426456 
121895 RCJ\A427396 
similar to contains 
121699 RCJIA427555 



AA398348 Hs^01720 Human ONA sequence (rom done RPf 1-251 J8 on chromosome 13 Contains ESTs, STSs, 



AA398482 Hs.97641 EST 

AA393037 Hs.161798 ESTs 

AA398505 Hs.97360 ESTs 

AL389951 Hs.271623 nucteoporfn 50kD 

AA398523 Hs.210579 ESTs 

AA398625 Hs.97391 ESTs 

AA398632 Hs.97395 ESTs 

AA393288 Hs.97396 ESTs 

AA398894 Hs.97657 ESTs, Moderately similar to ALU8JWMAN ALU SUBFAMILY SX SEQUENCE 

AA398895 Hs.97658 EST 

AA402505 gbzt62M0.r1 SoaresJestbJWT Homo sapiens cDNA clone 5, mRNA sequence 

AA398904 Hs.332690 ESTs 

AA398187 Hs.104682 ESTs, Weakly similar to mitochondrial crtrata transport protein [Usaptens] 

AA399371 Hs.189095 similar to SALL1 (sal (DrosophilaHike 

AI126713 Hs.192233 ESTs, Highly similar to T00337 hypothetical protein WAA0568 (H haptens] 

AL042981 Hs.251278 K1AA1201 p 

AA399636 Ha. 143629 ESTs 

AA399640 Hs.97694 ESTs 

AIS76062 Hs.1 11902 ESTs 

AL121523 Hs.97774 ESTs 

AA400262 Hs. 190093 ESTs 

A1002110 Hs.97169 ESTs, WeaMy similar to dJ667H1 2.2.1 IRsaplensl 

AA400748 Hs.97823 Homo sapiens mRNA; cDNA OKFZp434D024 (from done DKFZp434D024) 

AA814948 Hs.95343 ESTs, Weakly similar to ALUCJHUMAN Hi! ALU CLASS C WARNING ENTRY 111 [H.sapiens] 

AA037121 Hs.98518 Homo sapiens cDNA FU1 1490 fis, done HEMBA1001918 

AA292873 Hs.177996 ESTs 

AA401695 Hs.97334 ESTs 

AA402227 Hs.22826 tropomoduCn 3 (ubiquitous) 

NM006202 Hs.89901 phosphodiestemse 4A, cAMP-spedfic (dunce (DrosophllaHwmolog phosphodiesterase E2) 

AA402567 Hs.325520 IAT1-3TM protein 

AA293863 Hs.97316 EST 

AA402468 Hs.291557 ESTs 

AK0O0112 Hs.89306 hypothetical protein RJ20105 

AA291411 Hs.97247 ESTs 

AA404229 Hs.97842 EST 

A1768623 Hs.108264 ESTs 

U16125 Hs.181581 glutamate receptor, lonotroplc, kainate 1 

AA4O5026 Hs.1 93754 ESTs 

AA4051B2 Hs.97973 ESTs 

AA405237 gb:zK)6e10.s1 NCLCGAPJ3C81 Homo sapiens cDNA clone IMAGE:712362 9 stater to 

AA406061 Hs.98001 EST 

AA408063 Hs.98003 ESTs 

AA406070 Hs.98004 EST 

AA408137 Hs.98019 EST 

AA035279 Hs.176731 ESTs 

AL120659 Hs,611 1 aryi-hydrocarbon receptor nuclear translocate r 2 

AA411804 Hs.261575 ESTs 

AA402335 Hs.188760 ESTs, Highly similar to Trad Sapiens] 

AW665325 Hs.98120 ESTs 

AA778658 Hs.98122 ESTs 

AA412497 gb:zt95gmi Soares^testls.NHT Homo sapiens cDNA clone IMAGB730150 3* similar to 

AI192044 Hs.104778 ESTs 

AI024471 Hs.98232 ESTs 

AA416867 Hs.98185 EST 

AA416874 Hs.98168 ESTs 

AA421133 Hs.104871 erythrocyte transmembrane protein 

AA421138 Hs.98334 EST 

AA150797 Hs.109276 latexln protein 

T90789 Hs.94308 RAB35, member RAS oncogene family 

AT251870 Hs.188898 ESTs 

AI338371 Hs.157173 ESTs 

AW023482 Hs.97849 ESTs 

AA424313 Hs.9B402 ESTs 

AW972853 Hs.112237 ESTs 

AI732692 Hs.165066 ESTs, Moderately slmBar to ALU2_HUMAN ALU SUBFAMILY SB SEQUENCE 

AA425887 Hs.98502 hypothetical protein FU 14303 

AA426456 Hs.98469 ESTs 

AA427396 gbzw33a0ls1 Scares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:771050 3* 

R55341 HsiM21 KIAA0203 gene product 
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121917 RC_AA428218 

121918 RCJVA428242 

121919 RCJVA428281 

121941 RCJW428865 

121942 RCJVA428994 
121970 RC JVA429666 
121993 RCJW430181 
134660 RCJ\A430184_s 
128753 RCJ\A431288_s 
122022 RCJW431293 

122050 RCJVA431478 

122051 RCJW31492 
122055 RCJVA431732 
122105 RC.AA432278 
122125 RCJVU34411 
135235 RCJVA435512J 
122162 RCJVA435698 
129406 RCJVA435711 
318801 RCJ\A435815_s 
122186 RCJW435842 
122235 RC_AA435475 
129131 RC_AA436489 
134664 RC_AA442060 
122310 RCJ\A442079 
122334 RC_AA443151 
122382 RC_AA446133 
122425 RCJ\A447145 
122431 RCJW47398 
122450 RCJW47643 
302653 RCJW447742_8 
122477 RCLAA448226 
122500 RCJW448825 
122522 RCJ\A449444 
122536 RCJW50087 
122538 RCJW450211 
122540 RCJVW50244 
122560 RCJW452123 
421919 RCJ\A452155 
122562 RCJW452156 
mRKA 

122585 RCJVA453036 
122608 RC.AA453526 

122635 RC_AA454085 
similar to 

122636 RCJW454103 
122653 RCJM54642 
122660 RCJ\A454935 
122703 RCJW456323 
122724 RCJ\A457395 
122749 RCJW458850 
122772 RCJVA459662 
131098 RCJ\A459668 
129045 RCJ\A459679_s 
122777 RC_AA459702 
135382 RCJW460017J 
122798 RCJ\A460324 
122837 RCJW461509 

122860 RCJW464414J 
rnRNA sequence. • 

122861 RCJW464428 
122910 RCJ\A47Q084 
132899 RCJM76606_3 
122967 RCJVA478521 
129560 RCJUV478523 
123009 RCJVA479949 
128917 RCJW481252 
123081 RCJ\A485351 
123133 RCJW487264 
123184 RCJW469072 
129671 RCJ\A489630 

123233 RCJ\A490225 
(H^apiensl 

123234 RC.AA490227 
123236 RCJW490255 
123255 RCJW490890 
129503 RCJ\A490916jj 



AA406397 Hs£8038 ESTs 

BE274689 Hs. 184 175 chmmosome 2 open reading frame 3 

M4282S1 Hs.98560 EST 

AA428865 Hs.98563 ESTs 

AW452701 Hs.293237 ESTs 

AA429666 Hs.98617 EST 

AW297880 Hs.98661 ESTs 

U73524 Hs.87465 ATP/GTP-ttnding protein 

AA306478 Hs.95327 CD3D antigen, delta polypeptide (TTT3 complex) 

AA431293 H&98716 ESTs, Moderatety slmflar to T42650 hypothetical protein DKFZp434D0215.1 pisapiens] 

AW53076 Hs.166109 EUV (embryonic tethat, abnormal vision, DrosophilaHike 2 

AA431492 HsS8742 EST 

AA431732 Hs.98747 EST 

AW2416S5 Hs.98699 ESTs 

AK000492 Hs.98806 hypothetical protein 

AW298244 H&293507 ESTs 

AA628233 Hs.79946 cytochrome P450, subfamily XIX (aramatlzation of androgens) 

AB018255 Hs.111138 K1AA0712 gene product 

U40763 Hs.77965 peptkjyHxotyl feornerase G {cydophiBn G) 

AA398811 Hs.104673 ESTs 

AA436475 Hs.112227 membran^assodated nucleic acid binding protein 

AS026438 Hs. 177534 dual specffidty phosphatase 10 

AA256106 Hs.87507 ESTs 

AW192803 Hs.98974 ESTs, Weakly similar to S65824 reverse transcriptase homotog [H-sapiens] 

BE465894 Hs.98365 ESTs, Wealdy similar to LB4D_HUMAN NADP-DEPENDENT LEUKOTR1ENE B4 12- 

AA446440 Hs.98643 ESTs 

AB007859 Hs. 100955 K1AA0399 protein 

AA447398 Hs.99104 ESTs 

AA447643 Hs.112095 hypothetical protein DKFZp434F1819 

AJ404468 Hs.284259 dynein, axonemal, heavy polypeptide 9 

AA448226 H&324123 ESTs 

AA448825 Hs.99190 ESTs 

AA299607 Hs.98969 ESTs 

AF060877 Hs.99236 regulator of G-protein signalling 20 

AA450211 Hs.99239 ESTs 

AA476741 Hs.98279 ESTs, WeaWy similar to A43932 mucin 2 precursor. Intestinal [Ksapiens] 

AW392342 Hs.283077 centrosomal P4.t associated protein; uncharactertzed bone marrow protein BM032 

AJ224901 Hs.1 09526 zinc finger protein 198 

AA452156 gbzx29c03.s1 Soares_totaLfetusJfl)2HF8_9w Homo sapiens cDNA clone IMAGE787876 3\ 

AIS81654 Hs.1 70737 riypcrthetlcalDiotehFU23251 

AA453525 Hs.143077 ESTs 

AA454085 gbax33a08 M Soares_totatfetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE788246 3* 

AW651706 Hs.99519 hypotheticsJ protein FLM4007 

AW009166 Hs.99376 ESTs 

AI816827 Hs.180069 nuclear respiratory factor 1 

AA456323 Hs.269369 ESTs 

AA457395 Hs.99457 ESTs 

AA458850 Hsi293372 ESTs, Weakly similar to B34087 hypothetical protefo [H .sapiens] 

AW1 17452 Hs.99489 ESTs 

U66669 Hs.236642 34iydraxyisobutyrykCoenzyrne A hydrolase 

A1082883 Hs.30732 hypothetical protein FU 13409; WAA1711 protein 

AK001022 Hs.214397 hypothetical protein RJ10160 similar to insulin related protein 2 

AA978128 Hs.99513 ESTs, WeaWy similar to T 17454 (fiapharous-related forrnin - mouse [M.musculus] 

AW366288 Hs.145696 splicing factor (CC1 .3) 

AA461509 Hs.293565 ESTs, WeaWy simDar to putative p150 [Rsaplensl 

AA464414 gfczx78gOU1 Scares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:809904 7, 

AA335721 Hs.119394 ESTs 

AA470034 Hs.98358 ESTs 

AA476606 Hs.59666 SMAD In the antisense orientation 

AA806187 Hs289101 glucose regulated protein, 58kD 

AA317841 Hs.7845 hypothetical protein MGC2752 

AA535244 Hs.78305 RAB2, member RAS oncogene family 

A1365215 Hs206097 oncogene TC21 

AI815486 Hs^43901 Homo sapiens cDNA RJ20738 fc, done HHP08257 

AA487264 Hs.1 54974 Homo sapiens mRNA; cDNA DKFZp667N064 (from clone DKFZp667N064) 

BE247767 Hs.18166 K1AA087O protein 

NWL014700 Hs.1 19004 K1AA0665 gene product 

AW974175 Hs. 188751 ESTs, WeaWy similar to MAPB_HUMAN MlCROTUBULE-ASSOCiATED PROTEIN 1B 

NM_001938 Hs.16697 acvm^ulatortf transcripts 1, TBP-bindlng (negative oofactor2) 

AW968504 Hs. 123073 CDC2-reIated protein Wnase 7 

AAB30335 Hs.105273 ESTs 

AW768399 Hs.112157 ESTs 



97 



WO 02/079492 



PCT/US02/04915 



131043 RC.AA490925 
23259 RCJU490955 
[Haptens] 

23284 RCJ\A495812 
23286 RC_AA495824 
23315 RCJVA496369 
to contains 

29179 RCJ*A504125_s 
31612 RCJW521473 
23421 RC_AA598440 
23449 RCJM598899J 
129021 RC^AA599244 
I32830 RC_AA599694_s 
'23497 RCJW600037 
•23604 RC_AA6Q9135 
29539 RCJVA609582 
123712 RCJVA609684 
23731 RCJVA609839 
shiBarto 

30725 RC_AA609862 
23800 RCJU620423 
23841 RCJVA620747 
23929 RCJVA621364 
23978 RC.C20653 
133184 RC_D20085 
32835 RCJOT49 
32406 RC_D51285.s 
28695 RC.D59972J 
24028 RC_F04112J 



24057 RC_f 13604 
34899 RC.H01662 
30973 RC_H05135J 
24106 RC_H12245 
24136 RC Jt22842 
14165 RC_H30894 
1229 RCJW3442_s 
(24178 RCJ445996 
129948 RC_H69281J 
34374 RC_tf69485J 
I24254 RCJt69899 
si mil ar to 

29056 RC_H70627_s 
100919 RC_H73050_s 
30724 RC_H73260 
100716 RC_H77531_s 
I24274 RC.H80552 
129078 RC_H80737_s 
,24828 RCJ193412 
124315 RC_H94892_s 
00747 RC_H95643_s 
I24324 RC_H98552 
452933 RC_H97146 
32231 RCJ19913U 
29170 RC_H99462_s 
33143 RC_H99837_s 
32963 RCJI2214G 
35297 RCJJ22197 
34347 RC_N23756_3 
30365 RC.N24134 
21642 RCJ424195 
439311 RCJJ26739 
I24383 RCJ127098 
124387 RC.N27637 
129341 RCJU3090 
129081 RC.N35967 
I02827 RC_N38959_f 
,24433 RCJI39069 
124441 RCJM6441 
I32338 RC_N48270J 
i31403 RC_N48365js 
,24466 RCJ451316 
32210 RCJJ51499J} 
;24463 RCJI53976 

24484 RCLN54157 

24485 RCJ454300 



AF084535 Hs.22464 epilepsy, progressive myoclonus type 2, Lalora disease {laforin) 

AI744152 Hs.283374 ESTs, WeaWy similar to CA15 JiUMAN COUAGEN ALPHA 1(V) CHAIN PRECURSOR 

AA488988 Hs.293796 ESTs 

AA495824 Hs.188822 ESTs, Weakly similar to A46010 X-foiked retinopathy protein [H^apiens] 

AA496369 gbzv37d10.s1 Scares ovary tumor NbHOT Homo sapiens cDNA dona IMAG&755827 3 1 similar 

AW969Q25 Hs.109154 ESTs 

AU076668 Hs.334884 SEC10 (S. cerevistee)-flke 1 

AA598440 Hs 291 154 EST, Weakly similar to 138022 hypothetical protein (H .sapiens] 

AL049325 Hs. 112493 Homo sapiens mRNA; COMA DKFZp564D036 (from done DKFZp564D036) 

AL044675 Hs.173081 WAA0530 protein 

ML014777 Hs.57730 WAA0133 gene product 

AA765256 Hs. 135 191 ESTs, Weakly similar to unnamed protein product [H-saptens] 

AA609135 HS293076 ESTs 

T47614 Hs.323022 ESTs, Highly similar to p60 katanin [Rsapiens] 

AA609684 Hs. 11 2748 Homo sapiens cDNA: FU21543 fts, done COL06171 

AA609839 gbae62(01 .s1 Stratagene tung carcinoma 937218 Homo sapiens cONA done IMAGE351481 3* 

T98807 Hs.80248 RNA-blndlng protein gene with multiple splicing 

AA620423 Hs.112862 EST 

AA620747 Hs. 112896 ESTs 

AA621364 Hs.112981 ESTs 

789632 Hs. 170278 ESTs 

AA001021 Hs.6685 thyroid hormone receptor interactor 8 

Z83844 Hs.5790 hypofliefical protein dJ37E16.5 

AL133731 ' Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (from done DKFZp761C17l2) 

NM.003478 Hs.101299 cufflnS 

F04112 gfcHSC2JH062 normalized Infant brain cDNA Homo sapiens cONA done c-2jh06 3', mRNA 

AA902364 Hs.73853 bone morphogenetic protein 2 

AJ609045 Hs.321775 hypothetical protein DKFZp434D1428 

AI638418 Hs.78580 OEAD/H (Asp-GlU-Ate-Asp/Hls) box polypeptide 1 

H12245 gteym17a12/1 Scares infant brain 1NIB Homo sapiens cDNA done 3*. mRNA sequence 

H22842 Hs.101770 EST 

H30039 Hs.107674 ESTs 

NMJD15340 Hs.2450 leucyURNA synthetase, mitochondrial 

B5463721 Hs.971 01 putative G protein-coupled receptor 

AI537162 Hs.263988 ESTs 

N22687 Hs.8236 ESTs 

gb:yu70c1ls1 Weizmann Olfactory Epithelium Homo sapiens cDNA done IMAGE239158 3* 

Hs.108336 ESTs, WeaWy similar to ALUE_HUMAN 1111 ALU CLASS E WARNING ENTRY III [H .sapiens] 

X54534 Hs.278994 Rhesus blood group, CcEe antigens 

AK001507 Hs.306084 Homo sapiens done FLB6914 PR01821 mRNA, complete ods 

X89687 Hs.172350 HIR (histone ceil cycle regulation defective, S, cerevisiae) homolog A 

H80552 Hs.102249 EST 

AI351010 Hs.102267 lysosomal 

AW952124 Hs. 13094 presenlllns associated rrrarnboid-tike protein 

NM.005402 Hs.288757 v-ral simian leukemia viral oncogene homobg A (ras related) 

X04588 Hs.85844 neurotrophic tyrosine kinase, receptor, type 1 

H96552 Hs.159472 Homo sapiens cDNA: FU22224 fls, done HRC01703 

AW391423 Hs.288555 Homo saptens cONA.- RJ22425 fts, clone HRCC8686 

AA662910 Hs.42635 hypothetical protein DKFZp434K2435 

AW2S0380 Hs. 109059 rnltochcmdrial ribosoma! protein L12 

AA094538 Hs.272808 putative transcription regulation nuclear protein; K1AA1689 protein 

AA099693 Hs.34851 epsikm-tubufln 

All 18762 Hs.300208 Sec23-interacting protein p 125 

AF164142 Hs.82042 solute carrier family 23 (nudeobase transporters), member 1 

W56119 Hs.155103 eukaryotic translation initiation factor 1A, Y chromosome 

AF172066 Hs. 106346 retinoic add repressfote protein 

BE270668 Hs.151945 rrtftochondrtal ribosomal protein L43 

N27098 Hs.102463 EST 

N27637 Hs,109019 ESTs 

AI193519 Hs.226396 hypothetical protein FU1 11 26 

A1364933 Hs. 16891 3 serine/threonine kinase 24 (Ste20, yeast homoJog) 

BE244588 Hs.6456 chaperonln containing TCP1,subuntt 2 (beta) 

AA280319 Hs.288840 PR01575 proteh 

AW450481 Hs.161333 ESTs 

AA353868 Hs. 182982 gofeln-67 

AI473114 Hs.26455 ESTs 

R10084 Hs.113319 kinesin heavy chain member 2 

NM.007203 Hs.42322 A kinase (PRKA) anchor protein 2 

AI821780 Hs.179864 ESTs 

H66118 Hs^85520 ESTs, WeaWy similar to 210926OA B ceD growth factor [^sapiens] 

AB040933 Hs.15420 K1AA1 500 protein 
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124494 RCJK4831 
129200 RCJI59849 
124S27 RCJJ62132 
124532 RCJ462375 
133213 RC.N63138 
124539 RCJI63172 
133651 RC_N63772 
129196 RC_K63787 

124575 RCJJ68168 

124576 RCJ168201 

124577 RCJJ683Q0 
mRNA 

124578 RCJ468321 
124593 RCJ*69575 
128501 RC _N75007 
105691 RC_N75542 
128473 RC_N90066 
128639 RCJJ91246 
124652 RC.N92751 
133137 RCJI93214.3 
124671 RCJI99148 
PROTEIN 

133054 RCJTO7876 
[C.eiegans) 

130410 RC_R10865J 
124720 RC_R11Q56 
similar to 

124722 RCJN1488 
129961 RCJE2947 



N54831 Hs.271381 ESTs, Weakly similar to 138022 hypothetical protein [H sapiens] 

NS9849 Hs.13565 Sam68-i3<a prwsphotyrosine protein, T-STAR 

N79264 Hs.269104 ESTs 

N62375 Hs.102731 EST 

AA903424 Ha.6786 ESTs 

054120 Hs.146409 cell division cycle 42 (GTP-binding protein, 25kD) 

A13O1740 Hs.173381 dOrydropyrljr^lnase^ 

BE296313 Hs.265592 ESTs, WeaWy simSar to 138022 hypothetical protein [H.saplens] 

N68168 gb:za1tc01.s1 Scares fetal Over spleen 1NFIS Homo sapiens cONA dona 3". mRNA sequence 

N 68201 Hs£69124 ESTs, Weakly slmflar to 138022 hypothetical protein [risaplens] 

N68300 gb2a12g073l Scares fetal liver spleen 1NFLS Homo sapiens cONA done IMAGE592380 3\ 

N68321 Hs.231500 EST 

N69575 Hs.102788 ESTs 

AL133572 Hs.199009 protein containing CXXC domain 2 

A1680737 Hs.289088 Homo sapiens cOrWFU11918 Cs, clone HEMBB1000272 

T78277 Hs.100293 ODnked N^tytgtucosamlne (GteNAc) transferase {VJDP^^tylglucosamlna:p<)rypep6oe-N- 

AW582962 Hs.102897 CGM7 protein 

W19407 Hs.3862 regulator of nonsense transcripts 2; DKFZP434D222 protein 

AB002316 Hs.65746 WAA0318 protein 

AKD01357 Hs. 102951 Homo sapiens cONA RJ 10495 6s, clone NT2RP200Q297, moderately similar to ZINC FINGER 

AA464836 Hs.291079 ESTs, Weakly similar to T27173 hypothetical protein Y54G1 1A.9 - CaenorhabdTtis elegans 



J00077 
R05283 

T97733 
R23053 



Hs.155421 



aipha~fotoprotein 

gb:ye91c08.s1 Scares fetal Over spleen 1 KFLS Homo sapiens cDNA done IMAGE: 1 25102 3' 



Hs.185685 ESTs 



repetitive element 128944 RC_R23930_s AL137586 
132965 RCLR26589J A1248173 Hs.191460 



133740 RC_R37588j5 
133074 RCJB7613 
124757 RC_R38398 
124762 RCJB9179J 
124773 RCJW0923 
135266 RCJM1179 
131375 RC_R41294_s 
133753 RCJM2307J 
128540 RCJW3189J 
124785 RC.R43306 

124792 RCJW4357 

124793 RC_R44519 
sequence, 

124799 RCJM5088 

124812 5 RC - R47948J 

124821 RC_RS1524 

127274 RCLR54950 

124835 RC.R55241 

124845 RC.R59585 

124847 RC_R60044 

440830 RC.R60872 



AW162919 

AL134275 

H11368 

AA553722 

R45154 

R41179 

AW293165 

NM.004427 

AW297929 

W38537 

R44357 

R44519 

R450SS 

R47948 
H87832 
AW966158 
R55241 



W07701 
8E561430 



Hs.170160 

Hs.6434 

Hs.141055 

Hs.92096 

Hs.106604 

Hs.97393 

HS.143134 

Hs.165263 

Hs.328317 

Hs.280740 

Hs.48712 



Hs.188732 

Ks.7386 

Hs.58582 

Hs.101214 

Hs.101255 

Hs.304177 

Hs.239388 



gb.7h31a05./1 Scares placenta Nb2HP Homo sapiens cDNA done 5 1 similar to contains L1 

Hs.52763 anaphase-promoting complex subunlt 7 

hypothetical protein MGC12936 

RAB2, member RAS oncogene My-ifte 

hypothetica! protein DKFZp761F2014 

Homo sapiens done 23758 mRNA sequence 

ESTs. Moderately similar to A46010 X-Onked retinopathy protein {H^apiens] 

ESTs 

WAA0328 protein 
ESTs 

early development regulator 2 (homotog of poiyhomeofic 2} 
EST 

hypothetical protein MGC3040 
hypothetica) protein FU20736 

gb:yg24hQ4 j1 Scares infant brain 1NIB Homo sapiens cDNA done IMAGED 3350 3', mRNA 
gb.-yg38g04.s1 Scares infant brain 1 NIB Homo sapiens cDNA done IMAGE:34896 3\ mRNA 
ESTs 

ketch (OrosophlbHKe 3 

Homo sapiens cONA FU 12789 fis, done NT2RP2001947 

EST 

ESTs 

Homo sapiens done FLB8503 PR02286 mRNA, complete cds 
Human DNA sequence from done RP1-304B14 on chromosome 6. Contains a gene for a novel 



protein and a part of a gene for a novel protein with two isoforms. Contains ESTs, STSs, GSSs and a CpG island 

124861 RC_R66690 R67567 Hs.107110 ESTs 

130141 RC_R67266_s NNL0G4455 Hs. 150956 exostoses (muitipleHike 1 

124879 RCJ*73588 R73588 Hs.101533 ESTs 

124892 RC.R79403 AJ 970003 Hs. 23756 hypofteticaJ protein similar to swine acylneuraminate lyase 

124906 RCJ187647 H75964 Hs.107815 ESTs 

124922 RCJTO22 R93622 Hs.12163 eukaryotic translafon inflation factor 2, subunlt 2 {beta, 38kD ) 

124940 RC_R99599_s AF068846 Hs.103804 heterogeneous nuclear rtocnudeoproteln U (scaffoU attachment factor A) 

124941 RCJB9612 AI766661 Hs.27774 ESTs, Highly similar to AF161349 1 HSPC086 (H^aptens] 

124943 RC_T02888 AW963279 Hs.123373 ESTs. Weakly similar to AIUIJWMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
WARNING ENTRY [H.sapien$] 

124947 RCJTO3170 T03170 Hs.100165 

124954 RCJT10465 AW964237 Hs.6728 

132924 RC.J15418 f U55184 Hs.154145 

133113 RC_T15597J BE383768 Hs,65238 

132975 ROJ15653J R43504 Hs.6181 

133235 RC_T16898.s AW960782 Hs.6856 

131082 RCJ26644J AI091121 Hs.246218 

124980 RCJT40841 T40841 Hs.98681 

124984 RCJT47586J BE313210 Hs^23241 

124991 RC.T50116 T50116 

to slmDar to SP:VE22_LAMBD P03756 EA22 GENE , mRNA sequence. 

129475 RC_T50145_s NMJXM477 Hs.203772 FSHD region gene 1 



ESTs 

WAA1548 protein 
hypothetical protein FU1 1585 

95 kDa retinoblastoma protein binding protein; K1AA0861 gene product 
ESTs 

ash2 (absent small, or homeotfc, Drosophila, homotogHite 
Homo sapiens cDNA: FU21781 fis, clone HEP00223 
ESTs 

eukaryofic translation elongation factor 1 delta (guanine nucleotide exchange protein} 
gbyb77c10.s1 Stratagene ovary (937217) Homo sapiens cONA done IMAGE77202 7 similar 
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125000 RCJ58615 

[32932 RC.T59940J 

129534 RCJ63595 

125008 RCJT64891 

125009 RCJT64924 
I32940 RC.TB4933,r 

125017 RC.T68875 
sequence. 

125018 RC_T69027 
25020 RC_T69924 
129891 RCJ70353 
I34204 RCJ79780_s 
I25050 RQ.T79951 
I25052 RC_TB0174_s 
1 25054 RCTB0622 
'25053 RCTB5352 



AIQ84613 

AI873257 

AW970209 

T85104 

T80622 

T85352 



sex comb on mitfleg homolog 1 

gbyc19d03/1 Stratagene tung (937210) Homo sapiens cDNA done 5", mRNA sequence 
Hs.13197 ESTs 

Hs.7994 hypothetical protein FU20551 
Hs.111805 ESTs 

Hs.222779 ESTs, Moderately similar to similar to NEDD4 (H sapiens] 
Hs.268601 ESTs, Weakly similar to envelope {H.saplensl 
. „ gb:yd82d0U1Soare9 fetal Bver spleen 1NFLS Homo sapiens cDNA done IMAGEi 14721 y 

staffer to contains Ah) repetitive etementcontakis L1 repetitive element mRNA sequence. 

125084 RCJ85373 T85373 gb.-yd32f07.s1 Soares fetal Bver spleen 1NFLS Homo sapiens cONA done 1MAGE:1 14757 3* 

sbnBar to contains Alu repeflflve element; contacts MER3 repeSGve element ; mRMA sequence. 
125066 RCJT86284 T86284 
Ahi repetitive element;, mRNA sequence 
112264 RCJ89579_s AL045364 
125080 RCJ90360 T90360 
WARNING ENTRY ^sapiens] 
125097 RC_T94328J AW576389 
125104 RCJ95590 T95590 



T58815 

AW118826 

AKD02126 

T91251 

T64924 

T79138 

T68875 



Hs.110640 

KsX093 

Hs.11260 

Hs.303046 
Hs.127243 



ESTs 

Homo sapiens cDNA: FU22783 fs, done KAIA1993 
hypothetical proton FU1 1264 

gb:ydB0a10 M Soares fetal liver spleen 1NFLS Homo sapiens cONA done J, mRNA sequence 
ESTs 

Homo sapiens mRNA for K1M1724 protein, partial cds 

gb.yc30fQ5^1 Stratagene Over (937224) Homo sapiens cDNA clone 1MAGE82209 3\ mRNA 



T69027 Hs.57475 



Hs.79353 
Hs.268620 

Hs335774 



gb:yc77b07.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3' similar to contains 
transcription factor Dp-1 

ESTs, Highly similar to AUU6JWMAN ALU SUBFAMILY SP SEQUENCE CONTAMINATION 

EST, Moderately similar to S65657 alpha<10adrenergic receptor splice form 2 fH^apiens] 
gb:ye40a03^1 Soares fetal Over spleen 1NFLS Homo sapiens cDNA clone 3* similar to 
gb^10817|IGURRAA Iguana Iguana 5S (rRNA Y„ mRNA sequence 

135107 RC.T97257J T97257 Hs.337531 ESTs, Moderately similar to 138022 hypothetical protein [H^apiens] 
129550 RC.T97599J AA845462 Hs.124024 deltex(Drosophlla) homolog 1 

125118 RCJ97620 R10606 gb:yf35f1 1 s1 Soares fetal Over spleen 1 NFLS Homo sapiens cONA clone [MAGE:1 28877 3* 

similar to contains Alu repetitive element:. mRNA sequence. 

EST 

fibrillin 2 (congenital contractural aradinodactyly) , 
ESTs 

K1AA1321 protein 

Empirically selected from AFFX single probeset 
Homo sapiens mRNA M length Insert cONA done EURO IMAGE 614975 
activity-dependent neuroprotective protein 
ESTs 
ESTs 
ESTs 

hypothetical protein LOC57187 
heteroduomatln-fike protein 1 
hypothetical protein MGC4251 
nuclear mitotic apparatus protein 1 
H1 hlstone family, member 0 
sex comb on mkfleg (DrosophHaHIke 1 
S100 calctonvbtndlng protein A9 (cafgranuiln 8) 

ESTs, Weakly similar to TSP2JHUMAN THROMBOSPONDIN 2 PRECURSOR [H.sapiens] 
ESTs 

hypothetical proteh MGC2749 

ESTs, Weakly similar to ALUS J4UMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 

I25225 RC.W76540 W74169 Hs.16492 DKFZP564G2022 protein 

I25228 RC W79397 AA033982 Hs.110059 ESTs, Weakly similar to I33022 hypothetical protein [H.sapiens] 

132393 RC.WB5888 AL135094 Hs.47334 hypofneflcal protein FU 14495 

125238 RCJV86038 N99713 Hs. 10951 4 ESTs 

I25247 RC W86881 AA694191 Hs.163914 ESTs 

129296 RCW87804 AI051967 Hs.1 10122 ESTs 

I25263 RC W88942 AA098878 gban45g10.r1 Stratagene HeLa cell s3 937216 Homo sapiens cDNA done 5\ mRNA „ 

I25266 RC~W90022 W90O22 Hs. 186809 ESTs, Highly similar to LCT5LHUMAN LEUKOCYTE CELL-DERIVED CHEMOTAXIN 2 
PRECURSOR [H.sapiens] 

131321 RC.W92272 U91543 Hs.25601 chronxxiomaln heScase DNA binding protein 3 

131601 RCW92764_s NMJJ07115 Hs^9352 tumor necrosis factor, alpha-Induced protein 6 

131677 RC.W93040 H05317 Hs.283549 ESTs 

I20837 RC W93092 BE149656 Hs3Q6621 Homo sapiens cDNA FU11963 fis, clone HEMBB1001051 

125277 RC W932Z7 W93227 Hs.103245 EST 

125278 RC W93523 AI218439 Hs.1 29998 enhancer of pofycombl 
125280 RC_W93659 AI123705 Hs.106932 ESTs 

131856 RC W940O3.S W93949 Hs33245 ESTs v 

131844 RC W94401 s AM 19294 Hs.324342 ESTs 

125284 RC W94688 NA/L002666 Hs, 103253 perlfipln 

113447 RC W94787js AW016321 Hs.82306 destrto (adln depolymenzing factor) 

; 30799 RCJ08294_s AB028945 Hs.12696 cortacun SH3 ctomain-blnding protein 

125289 RC_£38311 T34530 Hs.4210 Homo sapiens cDNA FU 13069 fis, done NT2RP3001752 

I28874 RC_Z38465_s H06245 Hs.106801 ESTs, Weakly similar to PC4259 ferritin assodatBd protein [H.sapiens] 



125120 


RCJT97775 


T97775 


Hs.100717 


134160 


RCJT98152 


T98152 


Hs.79432 


125136 


RCJW31479 


AW962364 


Hs.129051 


125144 


RC W37999 


AB037742 


Hs.24336 


125150 


RCJW38240 


W38240 




104180 


RCJW4O150 


AA247778 


Hs.119155 


131987 


RC_W45435 


AW453069 


Hs.3657 


125178 


RC.W58202 


W93127 


Hs31845 


125180 


RC_W58344 


W58469 


Hs.103120 


125182 


RC.W58650 


AA451755 


Hs.263560 


130586 


RCW68736 • 


AL030996 


Hs.16411 


125197 


RC W69106 


AFO86270 


Hs^78554 


133497 


RC W69111 


BE617303 


Hs.74266 


100562 


RC W69385JJ 


NM_006185 


Hs.301512 


125639 


RCJV69399_s 


Z97630 


Hs226117 


129232 


RCJW69459 


R98881 


Hs.109655 


101495 


RC W72424 


W72424 


Hs.1 12405 


125209 


RC.W72724 


W72724 


Hs.103174 


125212 


RCJV72834 


AA746225 


Hs.103173 


129132 


RC.W73955 


BE383436 


Hs.108847 


125223 


RC_W74701 


AI916269 


Hs.109057 


WARNING ENTRY [H.sapiens] 
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130966 RCjB8525_s AW971018 Hs^1659 

128875 RC.Z38538J AB040923 Hs.108808 

133200 RCJ38551_s AB037715 Hs. 183533 

130158 RC_Z38783_s AB032947 Hs,151301 

5 125295 RCJZ39113 A8022317 Hs.25887 
donah, (sentaphorin) 4F 

12533 RCJ39255J AW972542 Hs.289008 

125300 RCJ29591 239591 Hs.101376 

323122 RC_Z39783_S BE622770 Hs.264915 

10 311463 RCJZ39920 R55344 Hs.22142 

130882 RC_Z40166J AA497044 Hs.20887 

128888 RC_740388j3 AI760853 Hs^41558 

125310 RCLZ40646 R59161 Hs.124953 

125315 RCLZ41697 R38110 Hs. 106296 

15 125317 RCJ99349 Z99348 Hs.112461 

135096 RCJ99394_s AA081258 Hs.132390 

104786 RC_AA027168 AA027167 Hs.10031 

132837 O58024.S AA370362 Hs.57958 

120456 RCJW251113 AA488750 Hs.88414 

20 132459 RCJW347573 AL120071 Hs.48998 

101545 M31210 BE246154 Hs.154210 

133505 C01527 AI630124 Hs.324504 

132360 RC_N6294B_s AW893660 Hs.46440 

132738 RC W42674 AKD00738 Hs.264636 

25 119586 RCJW43000_s AF088033 Hs.159225 

129914 RC_N31750_s NR.012421 Hs.13321 

130839 AP009301 AB011169 Hs.20141 

132813 L37347 BE313625 Hs.57435 

134342 M99564 NM.000275 Hs.82027 

30 131878 RCJVW30673 AA083764 Hs.6101 

105426 RCJ\A251297 W20027 Hs23439 

132968 RCJW620722 AF234532 Hs.61638 

132173 RC_W46577_s X89426 Hs.41716 

113932 RC.W81237 AA256444 Hs. 125485 

35 114452 RCJWQ20825 AI369275 Hs.243010 
PROTEIN TC10 

115243 RCJW278766 AA806600 Hs.116665 

134403 RC_H93708_s AA334551 Hs.82767 

129647 RCJW9394 AB018259 Hs.118140 

40 11142B RC_H56559LS AL031428 Hs.174174 

115967 RC J AA446887 AI745379 H3.42911 

120726 RC.AA293656 AA293655 Hs.97293 

114995 RCJ\A251152 AA769266 Hs.193657 

303876 RCJW233334J! U64820 Hs.66521 

45 dominant, ataxtn 3) 

311463 RC.239920 R55344 Hs.22142 

120302 RC.M192173 AA837098 Hs.269933 

133071 RC_AA455044 BE384932 Hs.64313 

121032 RCJVA398504 AA393037 Hs.161798 

50 129829 U41813 AF010258 Hs.127428 

120245 RC.AA166965 AW959615 Hs.111045 

120985 RCJW398222 AI219896 Hs.97592 

114184 RCJ39095 R56434 Hs.21062 

447503 RCJW284744J AA115496 Hs.336898 

55 cds 

132837 RC _AA428201 AA370362 Hs.57958 

121034 RCJW398507 AL389951 Hs.271623 

119718 RCJW69216 W69216 Hs.92848 

120455 RC.AA251083 AA251720 Hs.104347 

60 125280 RC.W93659 AM23705 Hs.106932 

132155 RC.AA227903 AK001607 Hs.41127 

120609 RCJVA283902 AW978721 Hs266076 

121278 RC.AA401631 AA037121 Hs,9851B 

109023 RCJW157293 AA157293 Hs.72168 

65 129815 RCJD60208J BE565817 Hs.26498 

108061 RCJW043979 AA043979 Hs.62651 

113287 RC.T66647 T66847 Hs.194040 

114082 RCJ38239 AK001612 Hs.26962 

116334 RCJ\A491457 AL038450 Hs.48948 

70 131486 RC.Z40Q71j> F06972 Ha.27372 

107860 RC.M024961 AA024961 Hs.50730 

131263 RC_AA443826 AU077002 Hs.24950 

132207 RCJM43294 BE206939 Hs.42287 

129183 RCJW155743 BE561824 Hs.273369 

75 408431 RCJ23708 AI338631 Hs.43266 

120575 RCJW280934 AW978022 Hs.238911 



ESTs 

ketch prosopMaHkel 
hypothefical proton FU10210 
Ca2+-dependent activator protein for secretion 

sema domain, Immunogtobulln domain (lg), transmembrane domain (TM) and short cytoplasmic 

Homo sapiens cDMA; FU21814 fis, done HEP01068 
EST 

Homo sapiens cONA RJ12908 fis, done NT2RP2004399 

cytochrome b5 reductase b5R.2 

hypothetical protein RJ 10392 

anadne (DrospphEa) hornotog 2 

ESTs 

ESTs 

ESTs, Weakly similar to I38022 hypothefical protein [H.sapiens] 
zinc finger protein 36 (KOX 18) 
WAA0955 protein 

EGF-W^traphi&Meiated protein 

BTB and CNC homology 1, basic leucine zipper transcription factor 2 

Ebronecflji leucine rich transmembrane protein 2 

endoftetial differentiation, sphingotlpid G-protetn-ooupted receptor, 1 

Homo sapiens mRNA; cDNA DKFZp586J0720 (from done DKFZp586J0720) 

solute carrier family 21 (organic anion transporter), member 3 

hypothetical protein RJ20731 

ESTs 

rearranged L-myc fusion sequence 
similar to S. cerevislae SSM4 

solute carrier family 1 1 (proton-coupled divalent metal too transporters), member 2 

oculocutaneous albinism II {pink-eye dilution (murine) homotog) 

hypothetical protein MGC3178 

ESTs 

myosin X 

endoflietel ceU-spedfic molecule 1 
hypothetical protein FU 12604; K1AA1692 protein 

Homo saptens cDNA FU14445 fis, clone HEMBB1001294, highly similar to GTP-BINDING 

K1AA1842 protein 

sperm specific antigen 2 

KIAA0716 gene product 

KIAA0601 protein 

ESTs 

ESTs 

ESTs 

Machadc-Joseph disease (spinocerebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

cytochrome b5 reductase b5R,2 
ESTs 

ESTs, WeaWy similar to AF257182 1 G-pratein-coupted receptor 48 (H.saplensJ 
ESTs 

homeoboxA9 
ESTs 
ESTs 
ESTs 

Homo sapiens. Similar to R1KEN cDNA 1810038N03 gene, done MGC^890, mRNA, complete 

^GF-TM74atrophEirweIated protein 

nudeoporinSOkD 

ESTs 

ESTs, Weakly similar to ALUCJWMAN nil ALU CLASS C WARNING ENTRY II! (H^aplens] 
ESTs 

rrypotheticalprotehFU13220 

ESTs, Weakly similar to A46010 X4Inked retinopathy protein [Haptens] 

Homo sapiens cDNA FU11490 Ms, done HEMBA1001918 

ESTs 

hypothetical protein FLJ21657 
EST 

ESTs. Weakly similar to I38022 hypothetical protein fH .sapiens] 
Homo sapiens cDNA FU10750 fis, clone NT2RP3001929 
ESTs 

BMX non-receptor tyrosine kinase 
ESTs 

regulator tfG-protein signalling 5 
E2F transcription factor 6 

uncharactertzed hematopoietic stem/progenitor ceBs protein MOS027 
Homo sapiens cONA: FU22536 fis. clone HRC13155 
hypothetical protein DKFZp762E1511; KIAA1816 protein 
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132121 RC J AA443284_s NWL004529 

17657 RCJ*39074 N39074 

34922 RC W04507 s AI718295 

18523 RCJW182B_s Y077S9 

16845 RC_H64973 AA649530 

15291 RC_AA279943 BE545072 

i 20326 RCJUM96300 AA195300 

30174 M29550 M29551 

29131 RCJW36489 AB026436 

29868 RCJW287032 AW172431 

18661 RC_N70777 AL137554 

29829 RCJW496921 AF010258 

15985 RC.AA447709 AA447709 

34637 RC_AA369856_s U87309 

132714 RCJW252598 W39388 

129771 RC.H73237 AL096748 

123360 RCJU504784 AA532718 

32902 RC.AA490969 A1936442 

13716 RCJ97750 AA001356 

13825 RC.W48860 AW014486 

30367 RCJ38501 AL135301 

20541 RC.AA27B298 W07318 

16727 RCJM3684 R76472 

18219 RC.N62231 AA862391 

19767 RCJV72562 W72562 

28917 RCJ\A481252 A1365215 

51553 RC_M02O928 AA018454 

32716 RCJU251288 BE379595 

18525 RCJJ67861 N67861 

14618 RCJW084162 AW979261 

19743 RC_W70242 AA947552 

108154 RC. d AA425151_s NNL005754 

I22798 RC_AA460324 AW366286 

133746 U44378 AW410035 

19822 RCJV74471 Af 086409 

22186 RC_AA435842 AA398811 

14941 RCJW243017 AA236512 

18053 RCJJ53367 N53391 

23234 RC.AA490227 NM001938 

29280 M63154 M63154 

18995 RC_N94591 N94591 

16750 RC.H05960 AA760689 

I29026 M96833 AL12Q297 

105127 RCJVA158132 AA045648 

14513 RCJW044825 AA044873 

11856 RC.T35697 H67899 

132036 W01568 AL157433 

130091 RCJA/88999 W88999 
sequence 

14108 U09564 AI267592 

19881 RC.W81456 W81486 

17770 RCJM7953 AW957372 

19850 RC.W80447 AI247568 

15439 RCJVA284561 A1567972 

123107 RCJ\A486071 AA225048 

406698 M24364 XD3068 

21231 RCJVW00780 AA814948 

32074 AB002366 AA478486 

13670 AB00O115 AB000115 

25277 RCJV93227 W93227 

14056 RCJW186324 AA188175 

121153 RCJW399640 AA399640 

121609 RCJW16867 AA416867 

20661 RCJVA287556 AA287556 

20850 RCJW349647 AA349647 

24947 RC.T03170 T03170 

130529 RC_AA280886 AA178953 

repetitive element, mRNA sequence 

117683 RCJJ40180 N40180 
IMAGE:276387 3' similar to contains Ll.tl 

20745 RC - AA302a09 AA302809 

20936 RCJUU85934 AA385934 

12597 RC.R78376 R78376 

120183 RCJM0174 AW082866 

120644 RCJVA237038 AI869129 



Hs.404 ' myaloWlymphold or mixeo^neage leukemia (trithorax posophBa) homolog); translocated to, 3 

Hs.44933 ESTs 

Hs.91161 prefokfti4 

Hs. 1701 57 myosin VA (heavy polypeptide 12, myoxin) 

gb:ns44i05.8 1 NCI_CGAPJUv1 Homo sapfens cONA clone, mRNA sequence 

Hs.122579 hypotiBfical protein HJ10461 

Hs.21145 hypothetical protein RG083M05.2 

Hs.151531 protein phosphatase 3 (formerty 2B), catalytic subunlt, beta boform (caldneurtn A beta) 

Hs.177534 dual specificity phosphatase 10 

Hs.13012 ESTs 

Hs.49927 protein kinase NYD^P15 

Hs. 127428 homeoboxA9 

Hs.268115 ESTs, Weakly similar to T08599 probable transcription factor CA1 50 [J-UapJens] 

his. 180941 vacuolar protein sorting 41 (yeast homolog) 

Hs.55336 Homo sapiens, clone MGC: 17421, mRNA, complete cds 

Hs.102708 DKFZP434A043 protein 

Hs.178604 ESTs 

Hs.59838 hypothetical protein RJ 10808 

Hs.18159 ESTs 

Hs.22509 ESTs 

Hs.6768 hypothetical protein FU 10849 

Hs.240 M-phase phosphoprotein 1 

Hs.65646 ESTs 

Hs.48494 ESTs, Moderately similar to A46010 X-Dnked retinopathy protein [H.sapiens] 

Hs.58119 ESTs 

Hs.206097 oncogene TC21 

Hil269211 ESTs 

Hs283738 casein kinase 1, alpha 1 

Hs.49390 ESTs 

Hs.291993 ESTs 

Hs.58086 ESTs 

Hs.220639 Ras-GTPase*ac6vafing protein SH3-domaln-blnding protein 

Hs.145696 spDdng factor (CC1.3) 

Hs.75862 MAD (mottters against decapentapieglc, Drosophlla) homolog 4 

Hs.301327 ESTs 

Hs.104673 ESTs 

Hs-87331 ESTs 

Hs.47629 ESTs 

Hs.16697 down-regulator of transcription 1, TBP-binding (negative cofactor 2) 

Hs.1 10014 gastric intrinsic factor (vitamin B synthesis) 

Hs.323056 ESTs 

Hs.92418 ESTs 

Hs.108043 Friend leukemia virus integration 1 

Hs.301957 nudix (nucleoside diphosphate linked moiety X)-type motif 5 

Hs.103446 ESTs 

Hs.4190 Homo sapiens cONA: FU23269 fis, clone COL09533 

Hs.37706 hypotheOcal protein DKFZp434E2220 

gbah70h035l SoaresJetaUlver_spleen_1NRS_S1 Homo sapiens cDNA clone 3', mRNA 

Hs.75761 SFRS protein kinase 1 

Hs.58648 ESTs 

Hs.46791 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 

Hs.58452 ESTs 

Hs.193090 ESTs, Highly similar to AF161437 1 HSPC319 [H^aplens] 

Hs.104207 ESTs 

Hs.73931 major histocompatibility complex, dass II, DQ beta 1 

Hs,96343 ESTs, WeaHy similar to ALUCJ1UMAN 111! ALU CLASS C WARNING ENTRY Hi [Hsapiens] 

Hs,3852 KIAA0366 protein 

Hs.75470 hypothetical protein, expressed In osteoblast 

Hs.103245 EST 

Hs.82506 KIAA1254 protein 

Hs.97694 ESTs 

Hs.98185 EST 

Hs,263412 ESTs. Weakly similar to ALU8 JiUMAN till ALU CLASS B WARNING ENTRY Ifl (HsapfensJ 

Hs.96927 Homo sapiens cONA FU 12573 fis, done NT2RM4000979 

Hs.100165 ESTs 

gb:zp39e03.s1 Stratagene musde 937209 Homo sapiens cDNA done 3* similar to contains AJu 

gb:yy44d02.s1 SoaresjnuWpte_sclerosis_2NbHMSP Homo sapiens cONA done 
L1 repetitive element mRNA sequence. 

gb£ST10426 Adipose tissue, white I Homo sapiens cDNA J end, mRNA sequence. 

Hs.97184 EST, HfcWy similar to (defline not available 7499803) (C.elegans] 

Hs29733 EST 

Hs.65882 ESTs 

Hs.96616 ESTs 
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119023 RC.N98488 
IMAG&310129 3', mRNA sequence. 

107582 RCJW002147 AA002147 

118249 RCJJS2580 K62S80 

115022 RCJW252029 AA252Q29 

117710 RCJM519B N45198 

115341 RCJW281452 AA281452 

118898 RC_N90680 N46213 

121121 RCJW399371 AA399371 

118329 RCJJ63520 N 63520 
3*. mRNA sequence. 

119498 RCJV35416 W35416 

118111 RCJ«5493 N55493 
mRNA sequence. 

119062 RCJU6898 AW444881 

116710 RC_F10577J F10577 

119261 RC_T15956 T15956 

122723 RCJW457380 AM57380 
similar to contains L1.b3 11 repetitive element 

117732 RCJJ46452 N46452 



flb2b82h01 31 Soaies_sen8ScenLriiOT)b!astsJ^bHSF Homo sapiens cDNA clone 
Hs.59952 EST 

HS322925 EST, Weakly similar to putative p150 (Haptens] 
Hs.87935 ESTs 

Hs.47248 ESTs. Highly similar to similar to Cdc14B1 phosphatase [H.saplens] 
Hs.88840 EST, Weakly similar to granule ceD marker protein [M.muscuius] 
Hs.54642 methionine adanosyttransferase 11, b8ta 
Hs.183095 similar to SAU1 (sai (DrosopnllaJ-DXe 

gb:yy62fD1 .si Soatesjnutttole^sdeioslsJJNbHMSP Homo sapiens cONA done IMAGE278137 

Hs.156861 ESTs, Moderately similar to A46010 X-Hnked retinopathy protein [Haptens] 

gb.yv50c02.s1 Scares fetal Over spleen 1NFLS Homo sapiens cONA clone IMAGE:246146 3*, 

ESTs 



Hs.77829 
Hs.306088 
Hs.65289 



v-crk avian sarcoma virus CT10 oncogene homoiog 
EST 

gb.aa86b10.s1 Stratagene fetal retina 937202 Homo sapiens cONA clone IMAGE A38171 3' 
mRNA sequence. 

gb:yy76h09 .si Soaresjnultiplejsclerosls JNbHMSP Homo sapiens CDNA clone 



1MAGE:279521 3* similar to contains L1.t2L1 repetitive element ;, mRNA sequence. 
104787 RCJ\A027317 AA027317 gb:ze97d11.8l SoaresjBteU«art_NbHH19W Homo sapiens cONA done IMAGE366933 3* 

similar to contains Alu repetitive element;, mRNA sequence, 



100071 A28102 
115819 RCJVA426573 
130882 RC_^40166J 
125225 RCJV76540 
108339 RCJW70801 



WARNING ENTRY [H sapiens] 



A28102 
AA486620 
AA497044 
W74169 
AW151340 



100338 D63483 
121636 RCJM17027 
103875 RCJW418387 
118716 RCJJ73460 
119763 RC_W72450 
121917 RCJW428218 
132806 M91488 
130949 Y10659 
108806 RCJW129933 
133276 RC.AA490478 
134760 RCJW58 
132867 AA121287 
132051 AA091284 
114208 RCJ39301 
104094 AA418187 
128718 AA426361 
302032 RC_N20407 
115501 RCJ\A291553 
101997 U01160 
103708 AA037206 
101899 S59184 
115839 RCJW429038 
409459 D50678 
103563 Z22534 
123233 RC_AA490225 
[Haptens] 

121305 RCJ\A402468 
114798 RCJW159181 
133145 RCJW96549 



D86S64 
AA379203 
T26379 
AI658908 
R54146 
AA406397 
AI699432 
AV656840 
AF070578 
AW978439 
NM_000121 
AF226667 
AA393968 
AL049466 
AA418187 
NMJW2959 
NM_001992 
AA291553 
AU076536 
AA430591 
S59184 



D86407 
L02911 
AW974175 

AA402468 
AA159181 
H94227 



131567 RC.AA291Q15JJ AF015592 

112300 RC_R54554 H24334 

129507 RCJW192099 AJ236885 

121033 RCJU398505 AA398505 

121151 RCJW399636 AA399636 

121402 RCJW06063 AA406O63 

123203 RCJVA489671 AA352335 

132271 RCJW236466 AB030034 

125197 RC W691Q6 AFQ86270 

114935 RCJUV242809 H23329 
WARNING ENTRY [H.saplensl 

125279 RC W93640 AW401809 

108778 RC.AA128548 AF133123 

108087 RCJW045709 AA045708 

132466 RCJJ66810.S AI597655 

133328 R36553 AW452738 

124057 RC_F13604 AA902384 

124800 RCJU5115 AW864086 



Human GABAa receptor alpha-3 subunlt 

Hs.41135 endomurin-2 

Hs.20887 hypothetical protein FU 10392 

Hs. 16492 DKFZP564G2022 protein 

Hs.51615 ESTs, Weakly similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.57735 acetyl LDL receptor; SREC 

Hs.306654 Homo sapiens cDNA FU13574 lb, clone PLACE1008625 

Hs.48802 Homo sapiens clone 23632 mRNA sequence 

Hs.1 18722 (ucosyttransferase 8 (alpha (1 ,6) fucosyi transferase) 

Hs.10450 Homo sapiens cONA: FU22063 fis, clone HEP10326 

Hs.98038 ESTs 

Hs.2786 1 9 hypo&ietlcal protein FU 10099 

Hs.2851 1 5 intsrleukin 13 receptor, alpha 1 

Hs.71168 Homo sapiens done 24674 mRNA sequence 

Hs.69504 ESTs 

Hs.89548 erythropoietin receptor 

Hs.5&553 CTP synthase II 

Hs.180145 HSPC030 protein 

Hs.7859 ESTs 

Hs.330515 ESTs 

Hs^81706 sorfiDnl 

Hs.128087 coagulation tactor II (thrombin) receptor 

Hs.190086 ESTs 

Hs.50984 sarcoma amplified sequence 

Hs.72071 hypothetical protein FU20038 

Hs.79350 RYK receptor-like tyrosine kinase 

Hs.28935 transducin-like enhancer of split 1, homoiog of Drosophfla E(sp1) 

Hs.54481 low density lipoprotein receptor-related protein 8, apolipoprotein e receptor 

Hs.150402 Acfivin A receptor, type I (ACVR1) (ALK-2) 

Hs.188751 ESTs, Weakly similar to MAPB J1UMAN MICROTUBULE-ASSOCIATED PROTEIN 1B 

Hs.291557 ESTs 

Hs.54900 serologically defined colon cancer antigen 1 

Hs.6592 Homo sapiens, done IMAGE:2961368, mRNA, partial cos 

Hs.28853 CDC7 (cell dMsbn cyda 7, S. cerevisiae, homologHike 1 

Hs.26125 ESTs 

Hs.112180 zinc finger protein 148 (pHZ-52) 

Hs.97360 ESTs 

Hs.143629 ESTs 

Hs.98003 ESTs 

Hs.65641 hypothetical protein FU20073 

Hs. 1 15175 sterlle-etoha motif and leucine zipper containing kinase AZK 

Hs .27 8554 hetero chroma tin-Tike protein 1 

Hs.290880 ESTs, Weakly similar to ALU INHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.4779 KIAA1150 protein 

Hs.90847 general transcription factor UlC, polypeptide 3 (102KD) 

Hs.40545 ESTs 

Hs.49265 ESTs 

Hs.265327 hypolhetkal protein DKFZp761l141 

Hs.73853 bone morphoganetic protein 2 

Hs.138617 thyroid hormone receptor interactor 12 
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121029 RCJVA39S482 AA398482 Hs.97641 

120663 RC AA287627 AA827793 Hs. 105089 

102133 U15173 AU076845 Hs.155596 

108246 RCJVA062855 AK23132 Hs.146343 

5 125226 RC_W78134 AA782536 Hs.122647 

120260 RCJW71739 AK000061 Hs.101590 

124906 RC_R87647 H75964 Hs.107815 

109406 RCJVA226877 AA199883 Hs.67624 

109271 RCJWV185668 AW137422 H-LB6022 

10 125052 RC_T8G174_s TB5104 Hs.222779 

109101 RCJW167708 AW608930 Hs^2184 

115241 RC JW278723 AA648278 Hs. 193859 

117163 RCJH97909 N36861 Hs.42344 

113530 RC.T90313 T90313 Hs.16732 

15 120375 RCJVA227260 AF028706 Hs, 111227 

129435 AA314256 AF151852 Hs.111449 

114864 RC.AA235256 AA135332 Hs.71608 

103988 AA314369 AA314389 Ha.42500 

131006 RCJ\A242763 AF064104 HsJ22116 

20 106781 RCJ\A478474 AA330310 Hs.24181 

106141 RCJW424558 AF031463 HSJ9302 

116213 RCJ\A476738 AA292105 Hs.326740 

135266 AB002326 R41179 Hs.97393 

135058 RCJW430152 AI379720 Hs.93814 

25 119908 RCJV85844 AA524470 Hs.58753 

103695 AA018758 AW207152 Hs.186600 

103978 AA307443 NM.O16940 Hs,34136 

109485 RCJW233472 BE619092 Hs.28465 

129574 AA458603 AA026815 Hs.11463 

30 115347 RCJVA281528 AA356792 Hs.334824 

120765 RCJW338735 AW961026 Hs.96752 
WARNING ENTRY tH.sapians] 

121059 RCJVA398628 AA393283 

131887 AA046548 • W17064 Hs.332848 

35 member 1 

112064 RCJW3812 AU049390 Hs.22689 

115606 RCJW400465 AI025829 Hs.86320 

131750 RCJ*94855j l NNL004349 Hs.31551 

102123 U14518 NM_001809 Hs.1594 

40 129847 RCJV46767 N64025 Hs.296178 

133809 RCJ\A235275 AV649326 Hs.76359 

132210 RC_N51499_s NMJ0O72Q3 Hs.42322 

122356 RCJ\A443794 AA443794 Hs.98390 

114958 RCJ\A243708 N20912 Hs.42369 

45 103951 AA287840 AU53944 Hs.50115 

134703 RCJVA280704 AF117065 Hs.88764 
128727 AA287864 AI223335 Hs.50651 
105743 RCJ\A293300_s BE246502 Hs.9598 
domain, (sBmaphorIn)4B 

50 103744 AA076003 AA079267 
sequence 

114348 N80402 AL050321 Hs.301532 

114009 RCJV900&7 AI248544 Hs.103000 

134704 RCJ\A28Q849 AA837124 Hs.88780 
55 128629 AA399187 AU096748 Hs.1027Q8 

104410 H65925 A1807519 Hs.104520 

110200 RC_H21075 H21075 Hs.31802 

124483 RCJK3976 AI821780 Hs.179864 

101391 M14648 NM_002210 Hs.295726 

60 109657 RCJ=04826 R60900 Hs.26814 

117140 RCJTO13 H96813 Hs.42241 

132937 RCJ\A233706J AW952912 Hs.300383 

129799 R36410 AW967473 Hs.239114 

105077 RCJVA142919 W55946 Hs.234863 

65 100850 RC_N58561_s AA836472 Hs.297939 

131043 RCJ\A490925 AF084535 Hs.22464 

118417 RC_N66048J AF080229 

129254 RCJ\A243695 AA252468 Hs.1098 

119149 RC_R58910 BE304701 Hs.65732 

70 133996 AA091367 AA380267 Hs.78277 

110223 RC_H23747 H19836 Hs.31697 

117626 RC_N36090 AK001757 Hs.281348 

135286 RC JkA424469_s AW023482 Hs.97849 

122967 RCJ\A478521 AA806187 Hs.289101 

75 131236 AA282640 AF043117 Hs.24594 

128568 AA463380 H12912 Hs.274691 



EST 
ESTs 

BCL2faaenovirus E1B 19kD-hteracflng protein 2 
ESTs 

N-myrfstoyltransferase 2 

hypothetical protein 

ESTs 

ESTs 

ESTs 

ESTs, Moderate)/ similar to similar to NEDO-4 [H sapiens] 

hypothefical protein FLI20618 

ESTs 

ESTs 

ESTs 

2c family member 3 {odd-paired Drosophfla homotog, heterotaxy 1) 

CGI-94 protein 

ESTs 

ADP-rtbosytaflon factor-like 5 

CDC 14 (cell division cycle 14, S. cerevfelae) homotog B 

ESTs 

phcsdudn-fike 

hypotheecal protein MGC10947 

K1AA0328 protein 

hypothetical protein 

ESTs 

ESTs 

chromosome 21 open reading frame 6 

Homo sapiens cONA* FU21869 8s, etone HEP02442 

UMP-CMP Wnase 

hypothetical protein FU 14825 

ESTs, WeaWy similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

gb:zt74e03j1 SoaresJestlsJIHT Homo sapiens cONA done 5", mRNA sequence 
SWI/SNF related, matrix associated, acfin dependent regulator of chromatin, subfamily e, 

Homo sapiens mRNA; cONA DKF2p58601318 (from clone DKFZp58601318) 
ESTs 

core-binding factor, runt domain, alpha subunit 2; translocated to, 1; cydin O-retated 
centromere protein A (17kO) 
hypothefical protein FU22637 
catalase 

A kinase (PRKA) anchor protein 2 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA 0KFZp761J1112 (from clone DKFZp761 J1112) 
male-specific lethaJ-3 (Drosoph!la)-iIke 1 
Janus kinase 1 (a protein tyrosine kinase) 

sema domain, Immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

gbzm97e10.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA done 2f, mRNA 

CRP2 binding protein 
WAA0831 protein 
ESTs 

DKF2P434A043 protein 

Homo sapiens cDNA FU 13694 (is, done PLACE2000115 

ESTs, Highly similar to A59266 unconventional myosin-15 IH.sapIens] 

ESTs 

integrin, alpha V (vitronectin receptor, alpha polypeptide, antigen C051) 

ESTs 

ESTs 

hypothetical protein MGC3032 

mannosidase, alpha, class 1A, member 2 

Homo sapiens cONA FU 12082 lis, clone HEMBB1002492 

cathepslnB 

epOepsy, progressive myoclonus type 2, Lafora disease (lafortn) 

gb^uman endogenous retrovirus K done 10.1 polymerase mRNA, partial cds 

DKFZp434J1813 protein 

ESTs 

DKFZP434F2021 protein 
ESTs 

hypothetical protein FU 10395 
ESTs 

glucose regulated protein, 58kD 

ubiqultination factor E4B (homologous to yeast UFD2) 

adenytateWnase3 
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i2688 RCJ03872 AW195317 
15192 RCJ\A261920 AA741024 
18688 RC_N71484 AK000708 
22264 RCJW436837 AA436837 
128981 AA135452 AA927177 
31042 RC R42457 
03704 AA028171 
21341 AA233107 
I06593 RCJW456826 AW296451 
15195 RCJW262156 AW968619 
15425 RC.AA284071 AA811895 
17258 RCJJ21299 
20209 RC_Z40892 
sequence 
34082 L16991 

04774 RCJVA026068 AW959755 
15625 RC.M401630 AA059459 
04469 N28707 
07401 W20054 
11686 RC.R21510 
15300 RCJW280026 AA280095 
15378 RCLAA282292 AA282292 
32224 RC_H97819 
13791 M95767 
29144 AA004987 
i04448 L44574 
32084 RC_T26981_s 
11831 RCJH6083 
14765 RCJW252163 AA463550 
15029 RC J AA252219 AL137939 
00457 H81492 
104536 R24011 
PROTEIN 91 

16167 RCJUW61562 AI091731 
103889 AA236771 R85350 
31978 RC_H48459_s AA355925 
18843 RC_N80181 
20837 RC_W93092 
33647 D21852 
29521 U41815 
03746 AA081876 
sequence 

32019 RCJ\A134965J H56995 
32310 RC.AA284107 AA173223 
17367 RCJJ24954 AI041793 
03743 AA075998 AA075998 



AA028171 
AF035528 



AF086041 
F02951 

L16991 Hs.79006 
Hs. 288896 
Hs.62592 
Hs.154304 
Hs.102987 
H523217 
Hs,88689 
Hs.279841 
HS.2S5410 
Hs.135578 
HS.20137 
NM.007331 Hs.110457 
NM-002267 Ha.3886 
R36095 Hs.268695 
Hs.337532 
Hs.40096 
Hs.285176 
Hs.158101 

Hs.87293 
Hs.101368 
Hs.36232 
Hs.221498 
Hs.306621 



N2B707 
N91453 
R22039 



N41549 

AI269096 
AL137275 



BE246400 
R24024 



N80181 
BE149656 

NMJJ15361 Hs.268053 

AF071076 Hs.112255 
AA075000 



gb:M15887 ACYL-CQA-BINDING PROTEIN (HUMAN);, mRNA sequence 



Hs.107716 hypothetical protein FU22344 

Hs.88378 ESTs 

Ks.169764 hypothetical protein FU20701 

gbzv57g07.s1 SoaresJesfe__NHT Homo sapiens cONA done 3\ mRNA sequence 

Hs£6041 CGG triplet repeat binding protein 1 

Hs.171637 hypotheBcal protein MGC2628 

Ks.151258 hypothetical protein RJ21062 

Hs. 1 53863 MAD (mothers against decapentapteglc, Drosophiia} homolog 6 

Hs.24605 ESTs 

Hs.155849 ESTs 

Hs.160680 ESTs, WeaWy stmflai to 154374 gene NF2 protein ptsapiens] 

Hs.42975 ESTs 

gb:HSC1 HB082 normalized infant brain cONA Homo sapiens cONA done c-1hb08 3*. mRNA 

deoxythymkJytate kinase (ihymidylatB kinase) 
Homo sapiens cONA FU12977 lis, done NT2RP20Q6261 
ESTs 

Homo sapiens chromosome 19, BAC 282485 (CfT-*344H19) 
ESTs 
ESTs 
ESTs 

hypothetical protein FU 10335 
ESTs 

chtobiase, di-N-acety}- 
hypothefical protein OKFZp434P0116 
Wolf-Hlrschncm syndrome candidate 1 
karyopherln alpha 3 (imporfin alpha 4) 
ESTs 

ESTs, Weakly similar to A47582 B-ceD growth factor precursor [H^apiens] 
ESTs 

906tyV*Co©nzy rnc A trsnsportsr 

Homo sapiens cDMA RJ14673 lis, clone NT2RP2003714, moderately similar to ZINC FINGER 

hypothetical protein FU20045 
ESTs 

K1AA0186 gene product 
ESTs 

Homo sapiens cONA FU11963 Ms, done HEMBB1001051 
WAA0029 protein 
nucleoportn 98kD 

gb:zm83c07.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA done 3\ mRNA 

Hs.37372 Homo sapiens ONA binding peptide mRNA, partial cds 
Hs.289044 Homo sapiens cONA FU12048 fis, clone HEMBB1001990 
Hs.42502 ESTs 

gb^m89b09/t Stratagene ovarian cancer (937219) Homo sapiens cDNA clone 5' similar to 



103761 AA085138 AA765163 
BINDING PROTEIN (HUMAN);, mRNA sequence 



AA913909 
AA504428 
AI187925 
BE219898 



I30237 U9060 

28752 RC_N72879 

35162 AA045930 

131386 AA096412 

29021 RCJW599244 AUJ44675 

>24274 AA293634 W73933 

129913 H06583 

131888 U79298 
23803 mRNA 

18612 RCJJ69466 

322026 AA203138 

10892 RCJJ38882 

11429 RCJW1245 

13334 RC.T76962 

104091 AA417310 



Hs.153088 
Hs.10487 
Hs.95667 
Hs.173135 
Hs.173081 
Hs.283738 
NM_001310 Hs.13313 
AW294659 Hs.34054 



gb:nz79b10.s1 NCL.CGAPJ3CB1 Homo sapiens cDNA done J similar to gb:M34539 FK506- 

TATA box binding protein (TBP)-associated factor, RNA polymerase I, A, 48kD 

Homo sapiens, done IMAGE:3954132, mRNA, partial cds 

F-box protein 30 , 

duakpedfidty tyrostne-OO-phosphorytatlon regulated kinase 2 

K1AA0530 protein 

casein kinase 1, alpha 1 

cAMP responsive element binding protein-like 2 

Homo sapiens cDNA: FU22488 fis, done HRC10948, highly similar to HSU79298 Human done 



AB037788 Hs.224961 

AW024973 Hs.283675 

AL035301 Hs.97375 

AI038052 Hs.19162 

AW974666 Ha.293024 

8E465093 Hs.106101 
I05246 RCJVA226879 AA226879 

IMAG£:663656 3* similar to contains AJu repetitive element mRNA sequence. 

13300 RCJ67448 T67448 Hs.13101 ESTs 
AW901347 



cleavage and polyadenytation specific factor 2, 100kD subunit 
NPD009 protein 

Haptens gene from PAC 106H8 
ESTs, WeaWy similar to 154374 gene NF2 protein JHsaplensl 
ESTs 

hypothetical protein FU22557 

gbw19c09.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 



17147 RC_H97225_s 

21349 RCJW405205 AA405205 

I00294 049396 AA331681 

33999 M28213 AA535244 

33259 AA278548 BE379646 

29423 AA371418 AA204686 

31098 RCJW459668 U66669 

35272 AA399391 AJ828337 

29155 AA046865 A1952677 



Hs.38592 hypothetical protein RJ23342 

Hs.97960 ESTs, WeaWy similar to T51 146 ring-box protein 1 (H .sapiens] 

Hs.75454 permdredoxin 3 

Hs.78305 RAB2, member RAS oncogene family 

Hs.6904 Homo sapiens mRNA full length insert cDNA done EUROIMAGE 2004403 

Hs.234149 hypothetical protein FU20647 

Hs.236642 3^draxyisobutvryl-Coenzyme A hydrolase 

Hs.97591 ESTs 

Hs.108972 Homo sapiens mRNA; cDNA DKFZp434P228 (from done DKFZp434P228) 
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311291 AA056319 
120750 RCJVA310499 
101002 J04058 
133012 AAD93241 
103879 AA228148_s 
131281 RC_ J AA443212 
115109 RCJW256383 
118502 RC_N673t7 
134100 L07540 
131869 AA484944 
115398 RC^AA282985 
103650 AA203742 
135089 N75611..S 
129938 U79300 
107508 W90095 
103685 AA005190 
125170 AA203147 
129179 RCJW504125.S AW969Q25 
116262 AA477046 AI936442 
123009 RCJW479949 
131004 D29833 
103317 XB3441 
132814 RC_C15251J 
103992 U77718 
109258 X59710 
110754 RCJ120814 
132727 AA136382.S 
100341 D63506 
134664 AA256106 
103826 AA165564 
111678 RC.R20628 
101341 L76159 
115455 RC.AA285068 
111192 RCJ\A477748 
129385 RCJW235604 
125050 RCJ79951 
122105 RC.M432278 
121324 RCJ\A404229 
120938 RCJ\A386260 
115001 RCJVA251376 
sequence. 

124799 RC.R45Q88 
sequence. 

122724 RCJW457395 
117791 RCJM8325 
121895 RCJVA427396 



AA782601 

A1191410 

AV855843 

AA847843 

BE543269 

AA251716 

AJ249977 

AL157488 

AA460085 

AW968547 

AA810854 

AW976877 

A1918035 

AW003668 

N74925 

AA158008 

AL020996 



AA535244 
029833 
X83441 
D60730 
BE018142 
AL044818 
AW302200 
N27495 
AF032922 
AA256106 
AW162998 
R38487 
NM_004477 
AA876O02 
AW021968 
AA172106 
AW970209 
AW241685 
AA404229 
AA386260 
AA251376 

R45088 

AA457395 
N48325 
M427396 



HSJ19817 ESTs 



Hs.169919 

Hs.62711 

Ks.50252 

Hs.25227 

Hs.88049 

Hs.50150 

Hs.171075 

Hs.33540 

Hs.89081 

Hs.36057 

H&301198 

Hs.135587 

Hs.38761 

Hs.292444 

Hs.8518 

Hs.109154 

Hs£9838 

HS78305 

Hs2207 

Hs.166091 

Hs.57471 

Hs.300954 

Hs.84928 

Hs.6336 

Hs.5565 

Hs.8813 

Hs.87507 

Hs.24684 

Hs.169927 

Hs.203772 

Hs.120551 

Hs.109438 

Hs.110950 

Hs.111805 



Hs.97842 
Hs.104632 



Hs.99457 
Hs.93956 



ESTs, Moderately stmflar to 2109260A B cell growth factor [Rsapiens) 
elecbw-transfef-Eavopfoteh, alpha polypeptide (gtutaric aciduria II) 
Homo sapiens, done IMAGE3351295, mRNA 
mitochondrial ribosomal protein L32 
ESTs 

protein kinase, AMP-acfivated, gamma 3 non-cataiyfc subunft 

Homo sapiens mRNA; cDNA DKFZp564B182 (from clone DKFZp564Bl82) 

replication (actor C (activator 1) 5 (36.51(D) 

ESTs, WeaWy similar to dJ309K20.4 pisaptens] 

ESTs 

ESTs 

roundabout (axon guidance receptor, DrasophOa) homotog 1 

Human done 23629 mRNA sequence 

Homo sapiens cDNA: FU21564 fis, done COL06452 

ESTs 

selenoprotein N 
ESTs 

hypothetical protein RJ 1 0808 
RAB2, member RAS oncogene tamtty 
salivary profine-rich protein 
BgaselV.rmATP-dependent 
ESTs 

Hunting tin interacting protein K 
nuclear transcription factor Y, beta 
KIAA0672 gene product 
hypothetical protein FU22626 
syntaxln binding protein 3 
ESTs 

KIAA1376 protein 
ESTs 

FSHD region gene 1 
tol-fike receptor 10 

Homo sapiens done 24775 mRNA sequence 

Rag C protein 

ESTs 

ESTs 

EST 

EST 

gb:zs10a06.s1 NCIjCGAPjGCBI Homo sapiens cDNA done !MAGE:6B4754 3*. mRNA 

gb:yg38g04.s1 Scares Infant brain 1NIB Homo sapiens cDNA done IMAGE34896 3*. mRNA 

ESTs 
EST 

gfczw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE771O50 3' 



similar to contains Alu repetitive etementjcontalns MER12.t2 MER12 repetitive element ;, mRNA sequence. 



108244 RCJW062839 
3', mRNA sequence. 
117852 ROJW9408 
109298 RCJW205432 
122432 RCLAA447400 
124627 RCJI74625 



AA062839 gbzm05c09.s1 Stratagene corneal stroma (937222) Homo sapiens cDNA clone IMAGE:513232 

AW877787 Hs.136102 K1AA0853 protein 

R77854 Hs.250693 Kroeppeketaled zinc finger protein 

AA447400 Hs.187684 ESTs, Weakly similar to B34087 hypothetical protein [Ksaplens] 

N74625 gb:za55c03.s1 Soares fetal Bvef spleen 1NFLS Homo sapiens cDNA done IMAGE296452 3 



similar to gb:M14338 VITAMIN K-OEPENDENT PROTEIN S PRECURSOR (HUMAN);contatns OFRJtf OFR repetitive element mRNA sequence. 



115141 RCJW258071 AA465131 
128636 U49065 U49065 
115373 RCJVV282197 AA664862 
114651 RCJW1Q1400 AA101400 
132796 RCJW180487 NM_006283 
103749 RC_N35583 AL135301 
107328 T83444 AW959891 
115349 RCJU281563 AF121176 
111490 RCJW6862 R06862 
similar to contains L1 repetitive element ; 
103763 AA085354 AA085291 
contains Alu repetitive element:, mRNA sequence 



Homo sapiens done 25218 mRNA sequence 
mterleuWn 1 receptor-ilka 2 
CGl-07 protein 
ESTs 

transforming, acidic coOed-coa ccmtamiig prater 1 
hypothetical protein FU10849 
WAA0887 protein 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 

gb:yfl1e09.s1 Soares fetal Over spleen 1NFLS Homo sapiens cDNA done IMAGE:126568 J 
mRNA sequence. 

gb:zn01g08.s1 Stratagene colon KT29 (937221) Homo sapiens cONA clone 3' similar to 



Hs.64001 
Hs.102865 
Hs.181022 
Hs.189960 
Hs.173159 
Hs.8768 
HSJ6591 
Hs.12797 



118791 RC_N75520 N75520 

116644 RC.F03032 F03032 

116823 RC.H56485 AW204742 
(H^aplensl 

108940 RC_AA148603 AA148603 
IMAGE:567198 3\ mRNA sequence. 

112218 RC.R50057 R50057 

116557 RC.D20572J D20572 

133649 U25849 U25849 

131745 RC_C20746 AI82B559 



Hs.261003 ESTs, Moderately similar to B34087 hypotheflcal protein (H .sapiens} 

Hs.290278 ESTs, WeaWy similar to B34087 hypothetical protein [H.sapiensJ 

Hs.143542 ESTs, Highly similar to CSAJttJMAN COCKAYNE SYNDROME WD-REPEAT PROTEIN CSA 

gfczo09e04.s1 Stratagene neuroepitheiium NT2RAMI 937234 Homo sapiens cDNA done 

Hs.272251 Homo sapiens mRNA; cDNA DKFZp586M1418 (from done DKFZp586M1418) 

Hs.90171 EST 

Hs.75393 add phosphatase 1, soluble 

Hs.31447 ESTs, Moderately similar to A46010 X-Onked reSnopathy protein {H-sapiens] 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



AA251548 

H29882 

AA347919 

AA018298 

AF025771 

N67946 

AB020700 



116801 RC.H43879 H43879 
sequence. 

115006 RCJ\A251548 
123424 RCJUV598500 
120831 RCJ*A347919 

103691 AA018298 
121555 RCJVA412491 
111193 RC_N67946 
132061 RCJW058946 
134575 RCJW94568J AA194568 
115050 RCJVA252794 AA252794 
420208 031799 
133735 AC002O45jcpt1 
128546 221305 
111946 RC.R40697 
124879 RC_R735B8 
115683 AA410345 

103692 AA018418 



gb.7069hG9.s1 Scares breast 3NbHBst Homo sapiens (DMA clone IMAGE183233 3\ mRNA 



Hs.87866 
Hs.162614 



Hs.103332 
Hs50123 
Hs. 117569 
Hs.3830 



EST 
EST3 
EST 
ESTs 

2bw finger protein 189 
ESTs 

WAA0893 protein 
EST 
ESTs 

sflver (mouse homotog) Ute 
K1AA0220 protein 
cut0n5 

C9orf10 protein 
ESTs 

Junctional adhesion molecule 2 

Homo sapiens chromosome X map Xp1 1.23 L-type calcium channel alpha-1 subunit 
(CACNA1F) gene, complete cds; HSP27 pseudogene, completa sequence; and JM1 protein, JM2 protein, and Hb2E genes, complete crjs 
103767 AA089888 BE244667 Hs.296155 CGM0O protein 

ESTs. Highly similar to LCT2_HUMAN LEUKOCYTE CELL-DERIVED CHEMOTAXIN 2 



BE276055 
R66740 
NM.003478 



HsJ8009 
Hs.95972 
Hs.1 10813 
Hs.101299 



R73588 

AF255910 

AW137912 



Hs.101533 
Hs.54650 
Hs.227583 



Hs.166809 



125266 W90022 W90Q22 
PRECURSOR [Rsapiensl 

135235 AA435512 AW298244 Hs.293507 

134497 RCJ\A404494 BE25B532 HsJ251871 

426754 RC_AA278529J NM_014264 Hs.172052 

412177 RCJW342828_s Z23091 Hs.73734 

132000 RCJW044644 AW247017 Hs.36978 

124738 RC_AA044644 T07568 Hs.137158 

324000 RC_AA196729J AA604749 Hs.190213 

106896 RCJW196729J AW073202 Hs.334825 

132000 RCJWJ25858 AW247017 Hs.36978 

129577 RC_AA025858 N75346 Hs.82906 

107091 RCJW233519 AI949109 Hs.246885 

130296 RC_N52271 D31139 Hs.154103 

102855 RCJJ68399 NMJJ03528 Hs.2178 

113689 RCJWJ98874 AB037850 Hs.16621 

100939 RCJVA279667_s L04288 Hs.297939 

130430 RC_H22556 W27893 Ha. 150580 

106734 RC_N45979_s BE296690 Hs.288173 
intersectin 2 long isoform (ITSN2) mRNA 

135148 RC_AA43128B_s AA306478 Hs.95327 

134221 RC.AA609862 BE280456 Hs.80248 

105376 RCJ&5583 AW994032 Hs.8768 

124541 U77718 AF112222 Hs.44499 

134546 AA203147 AL020996 Hs.8518 

134000 RCJV93092 AW175787 Hs.334841 

125656 RC.W93092 AW516428 Hs.78687 

100939 RC_N58561_s L04288 Hs.297939 

125656 RCJ/V93092 AW516428 Hs.78687 

101779 RC_W69385_s BE543412 rK250505 

332489 RC.R22947 R23053 NA 

133000 RC_N38959J AL042444 Hs.62402 

125905 RC_N38959_f AI678638 Hs.6456 

129000 RC_H73050jb AA744902 Hs.107767 

100920 RC_H73050ji X54534 Hs.278994 



ESTs 

OTP synthase 
serlne/BuBonine kinase 18 
glycoprotein V (platelet) 
melanoma antigen, family A, 3 
ESTs 
ESTs 

Homo sapiens cDNA FU14752 lis, clone NT2RP3003071 

melanoma antigen, family A, 3 

CDC 20 (cell division cycle 20, S. cerevisiae, homolog) 

hypothetical protein FU20783 

UM protein (similar to rat protein kinase C-blndlng enigma) 

H2B histone family, member Q 

DKFZP434M 16 protein 

cathepsInB 

putative translation Initiation factor 

Homo sapiens cDNA: FU21747 fs, clone COLF5160, highly similar to AF182198 Homo sapiens 

C030 antigen, delta polypeptide (TJT3 complex) 
RNA-bindirtg protein gene wffii multiple splicing 
hypothetical protein FU10849 
pinin, desmosome associated protein 



N 

selenium binding protein 1 

neutral sphingomyelinase {N-SMase) activation associated factor 
cathepstnB 

neutral sphingomyelinase (N-SMase) activation associated factor 

reiinoic acid receptor, alpha 

HuOIChlpRedos 

p21/Cdc42/Rad -activated kinase 1 (yeast Ste20-retated) 
chaperonln containing TCP1, subunit 2 (bete) 
hypothetical protein PR01489 
Rhesus blood group, CcEe antigens 



107 



WO 02/079492 



PCT/US02/04915 



TABLE 1 A 

Table 1A shows the accession numbers for those pkeys lacking unlgenelD's for Tables 1. The pkeys in Table 7 lacking urugeneJPs are represented within 
Tables 1-6A. For each probesei we have listed (he gene duster number from which the oDgonudeoSdes were designed. Gene dusters were complied 
5 using sequences derived from Genbank ESTs and mRNAs. These sequences were dustered based on sequence similarity using Ctustenng and 

Alignment Tools (DoubleTwtst Oakland California). The Genbank accession numbers for sequences oompnsmg each cluster are feted in me , Accession , 
column. 



1 0 Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 

Genbank accession numbers 



15 Pkey CAT Number Accession 



108469 
124106 
108501 
20 108562 
125008 
125020 
125066 



25 



30 



125104 
124575 
125263 
116845 
118417 



35 

116584 
103743 
103744 
103746 

40 103761 
103763 
120209 
120284 
112540 

45 111904 
121059 
121094 
114106 
130091 

50 122264 
108280 
129961 
130529 
108309 

55 107832 
123731 
116571 
132225 
125017 

60 125083 
125064 
100964 
125118 



116761J AA079487 M128547AA128291 AA079587 AA079600 
125446J H12245AA094769R14576 
13684 -12 AAD83S6 

36375~1 AA100796 AFO20589 AA074629 AA075946 AA100849 AA085347 AA 126309 AA07931 1 AA079323 AA085274 

1802095 J T91251 TB4691 T85665 

116017 J T69981 T69924 AA078476 

1814993 1 T86284T81933 

1532859J R61504F04247 

413347 J T95590AA703278HS2764 

1666649 1 N68168 N69168N9045Q 

1547_2 AA098878W88942 

393481.1 AA649530AA659316 H64973 

37186J AF080229 AF080231 AF030230 AF080232 ATO80233 AF080234 BE550833 AI636743 AW614951 BE467547 AI680833 
AI633818 N29986 U87592 U87593 U87590 U37591 S46404 U87587 AA463992 AW206802 A1970376 A158371 8 A1672574 
N25695 AW665466 A1818326 AA126128 AI480345 AW013827 AA248638 AJ214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AJ080480 A1631703 AI651023 AJ867418 AW818140 AA502500 A1206199 AI871282 A1352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 A1624817 BE46661 1 AI206344 AA574397 
AA348354 A1493192 

AW136928 A1685655 BE218584 BE465078 N68963 AA975338 BE147199 N76377 
AA075998 AA075999 AA070986 AA070896 AA 129207 AA078942 AA070783 AA078941 
AA079267 AA076003 
AA075000AA081876 

M765163AW298222M126126AA065138AA076058 
AA085291AA085354 
F02951 Z40892 F04711 
AA179656 AA1 82626 AA182603 
R69751 R70467 H69771 H80879 H80878 
241572 R39330 
AA393283AA398628 
AA4O2505AA398900 
AW602528BE073859Z38412 



65 125150 
116801 
118111 
118129 
118329 

70 118475 
111490 
111514 
104534 
12)340 



532052.1 
112194J 
114161J 
113452J 
114208J 
48290.6 
1531817J 
158963J 
1605263.1 
1719336 J 
273450,1 
275729 1 
1182096J 
23961.-3 
296527.1 
110682.1 
1706092J 
158447.1 
111495J 

genbankJ\A021473 
genbank^A609839 
genbanK_D45652 
genbank_AA128980 
genbank.T68875 
genbanK_T85352 
genbanR_T85373 
entre?J00212J00212 
149288 J R10606T97620AA576309 
entre^U30245U30245 

NOTJ : OUND.entre^W38240 W38240 



AA436837AA442594 
AA065069AA085108 
R23053R79884 R76271 
AA178953AA192740 

AA069818 AA069971 AA069923 AA069908 
AA021473 
AA609839 
D45652 
AA128980 
T68875 
T85352 
T85373 



genbank_H43879 
genbankJ455493 
genbanleN57493 
genbankJJ 63520 
genbankj^66845 
genbankj*06862 
gertbankj*07998 
R22303_at R22303 
genbankJ\A206828 



H43879 
N55493 
N57493 
N63520 
N66845 



R07998 
AA206828 
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10 



15 



20 



25 



30 



35 



40 



45 



120376 genbanlLAA227469 
104787 genbanlLAA027317 
120409 genbank^AA235050 
120745 genbankJ\A302809 
120809 genbanlLAA345495 
120839 genbanKJVA348913 
113702 genbanK_T97307 
115001 genbanJLM251376 
122562 genbankJ\A452156 
122635 genbank_AA454QB5 
108244 genbanlLAA062839 
108277 ganbanU\A064859 
122723 genbanLAA457380 
124028 genbank_P04112 
108403 8*^)^75374 
122860 genbanlLAA464414 
108427 gonbanKJW076382 
108439 genbanLAA07&986 
131353 231290J 
AA969360 

108533 genbanlLAA084415 
117031 genbanlLH88353 
124254 genbankJI69899 
101447 entrezjrf21305 
101458 entrezJA22092 
124577 genban!LN68300 

108940 genbanLAA148603 

108941 genbanKJ\A148650 
124627 genbankj>l74625 
124720 144582J 

124793 genbanlm44519 
124799 genbankJM5068 
117683 genbanK_N401BQ 
117732 genbank_N46452 
124991 genbanK_T50116 
119023 genban)LN9S488 
119239 95573_2 



AA227469 

AA027317 

AA235050 

AA302809 

AA346495 

AA348913 

T97307 

AA251376 

AA452156 

AA454085 



AA064859 

AM57380 

F04112 

AA075374 

AM64414 

AA076382 

AA078986 

AW411259 H23555 AW0 15049 AI684275 AW015886 AW068953 AW014O85 AI027260 R52686 AA918278 AJ129462 

N34869 AI94841B AA534205 AA702483 AA705292 

AA084415 

H88353 



M21305 
M22092 

N68300 
AA148603 
AA148650 
K74625 

R05283R11056 

R44519 

R45088 

N40180 

N46452 

T50116 



T11483T11472 



119558 NOT_FOUNDjBntrez_W38194 W38194 

119654 genbankJA/57759 W57759 

105246 genbankJVA226879 AA226879 

121350 genbanJLAA405237 AA405237 

121558 genbankJ\A412497 AA412497 

105985 genbanKJ\A406610 AA406610 

100071 eritoUW\Q2miQI2 

114648 genbanKuAA101056 AA101056 

121895 genbanlLAA427396 AA427396 

100327 entreUM5640D55640 

123315 714071J AA496369 AA496646 
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TABLE 2: 



Pkey. Unique Eos probesetUenfifier number 

Accession: Accession number used for previous patent filings 

ExAocn: Exemplar Accession number, Genbank accession number 

UnkjeneiD: Unkjene number 

UnjgeneTOe: Unkjene gene file 



100420 
100484 
100991 
101168 
101261 
101447 
101543 
101560 
101714 
101838 
102012 
102164 
102283 
102564 
102759 
102804 



10 



15 



20 



25 



30 



35 



40 



45 



50 109768 
110906 
111006 
111133 
113073 

55 113923 
115061 
115145 
. 115947 
116339 

60 116589 
117023 
117563 
118475 

^ 119073 

65 119174 
119416 
121335 
123160 

™ 123523 
70 123964 

124315 

124669 

124875 

125103 

75 125565 



103036 
103095 
103166 
103280 
103850 
104592 
104786 
104865 
104952 
105178 
105330 
105729 
105977 
106031 
106155 
106423 
107174 
107295 
108756 



100420 
100484 
100991 
101168 
101261 
101447 
101543 
101560 
101714 
101838 
102012 
102164 
102283 
102564 
102759 
102804 
102898 
103036 
103095 
103166 
103280 
103850 
104592 
104786 
104865 
104952 
105178 
105330 
105729 
105977 
106031 
106155 
106423 
107174 
107295 
108756 



109168 
109768 
110906 
111006 
111133 
113073 
113923 
115061 
115145 
115947 
116339 
116589 
117023 
117563 
118475 
119073 
119174 
119416 
121335 
123160 
123523 
123964 
124315 
124669 
124875 
125103 
125565 



ExAccn 


UnlgenefD 


086983 


Hs.1 18893 


NNL005402 


Hs. 288757 


J03836 


Hs.62085 


NM.005308 


Hs-211569 


D30857 


Hs.62353 


M213Q5 




M31166 


Hs.2050 


AW958272 


Hs. 347326 


M68874 


Hs.211587 


BE243845 


Hs.75511 


BE259035 


Hs.1 18400 


NM_000107 


Hs.77602 


AW161552 


Hs.83381 


U59423 


Hs,79067 


NM_005100 


Hs.788 


NM_002318 


Hs.83354 


NMJW2205 


Hs.149609 


M13509 


Hs.83169 


NM.005424 


Hs.78824 


AA1 59248 


Hs. 180909 


U84722 


Hs.76206 


AA187101 


Hs.213194 


AW630488 


Hs.25338 


AA027167 


Hs.10031 


T79340 


Hs.22575 


AW076098 


Hs.345588 


AA3 13825 


Hs.21941 


AW338625 


Ha 221 20 


H46612 


Hs.293815 


AK001972 


Hs.30822 


X64116 


Hs,171844 


AA425414 


Hs.33287 


AB020722 


Hs.16714 


BE122762 


Hs.25338 


AA1 86629 


Hs,80120 


AA127221 


Hs.1 17037 


AA135606 


Hs.189384 


AA219691 


Hs.73625 


F06838 


Hs.14763 


AA035211 


Hs.17404 


BE387014 


Hs.166146 


AW580939 


Hs.97199 


N39342 


Hs.103042 


AW953484 


Hs.3849 


AI751438 


Hs.41271 


AA740907 


Hs.88297 


R47479 


Hs.94761 


AK000290 


Hs.44033 


A1557212 


Hs.17132 


AW070211 


Hs.102415 


AF055634 


Hs.44553 


N66845 




BE245360 


Hs.279477 


R71234 




T97186 




AA404418 




AA488687 


Hs.284235 


AA608588 




C13961 




N(VLQ054Q2 


Hs.288757 


AI571594 


HS.102943 


A1887664 


Hs-285814 


AA570056 


Hs.1 22730 


R20840 





UnlgeneTttto 

Melanoma associated gene 
v-ral simian leukemia viral oncogene hom 
serine (or cysteine) proteinase InWbfto 
G protein-coupled receptor kinase 5 
protein C receptor, endothelial (EPCR) 
gbiHuman afcha sataHfe and satellite 3 
pentaxfrwebted gene, rapidly induced b 
WerceOutar adhesion molecule 2 
ptospholpase A2, group IVA {cytosolic 
connective tissue growth factor 
singed (Drosopnlla}-Iike (sea urchin fas 
damage«pecifc DMA binding protein 2 (4 
guanine nucleotide binding protein 1 1 
MAD (mothers against decapentapleglc, Dr 
A kinase (PRKA) anchor protein (gravtn) 
tysyf oxWase-Dke 2 

tntegrin. alpha 5 (fibronectln receptor, 
matrix metanoprotelnase 1 (interstitial 
tyrosine kinase with immunoglo buOn and 
peroxtaedoxin 1 

cadherin 5, type 2, VE-cadherin (vascula 
hypothetical protein MGC10895 
protease, serine, 23 
KIAA0955 protein 

B-cell CUJIyrnphoma 6, member B (zinc fl 

desrnoplaWn{DPI,OPll) 

AO036 protein 

ESTs 

Homo sapiens HSPC285 mRNA, partial cds 
hypofrefcal protein FU1 11 10 
Homo sapiens cDNA: FU22298 fe, done H 
nuclear factor l/B 

Rho guanine exchange factor (GEF) 15 
ESTs 

UDP41-acfityVelpha^alaetD^ 
ESTs 

gb.Tl10a05.s1 SoaresjjregnanUiterus.NbH 
RAB6 Interacting, kinesin-llke (rabktnes 
ESTs 
ESTs 

Homer, neuronal immediate early gene. 3 
complement component C1q receptor 
mkrotubute-associated protein 1B 
hypothetical protein FU22041 similar to 
Homo sapiens mRNA full length insert cDN 
ESTs 

WAA1691 protein 



ESTs, Moderately similar to I54374 gene 
Homo sapiens mRNA; cONA OKFZp586N0 1 2 1 ( 
unc5 {C.elegans homolog) c 
gfrza46c1 U1 Scares fetal fiver spleen 
ESTs 

gb:y54c08.3l Scares placenta Mb2HP Homo 
gb.-ye50h09.s1 Scares fetal Over spleen 
gb2w37e0ls1 Soares_tDtal_fetus_Nb2Hf8_ 
ESTs, Weakly similar to 138022 hypotheti 
gbae54e06.s1 Stratagene lung carcinoma 
gfcC13961 Ctontech human aorta polyA+mR 
v-ral simian leukemia viral oncogene hom 
hypothetical protein MGC12916 
sprouty{Drosophlla) homolog 4 
ESTs, Moderately similar to K1AA121 5 pro 
gb:yg05c08/l Scares Infant brain 1NIB H 
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126511 
126649 
449602 
127402 



126511 
126649 
449602 
127402 
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129765 
129884 
10 130639 
130828 
131080 
131182 
131573 
15 131756 
131681 
132083 
132358 
132456 
20 132576 
132718 
132760 
132968 
133061 
25 133161 
133260 
133491 
133550 
133614 
30 133691 
133913 
133965 
134068 
134299 
116470 
134989 
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100114 
100143 
100208 
100405 
100455 
100618 
100658 
45 100718 
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100991 
101110 
101156 
50 101184 
101317 
101345 
101475 
101496 
55 101543 
101560 
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101634 
101662 
60 101720 
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101837 
101840 
101864 
65 101966 
102013 
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70 102460 



75 



116470 
134969 
135073 
100114 
100143 
100208 
100405 
100455 
100618 
100658 
100718 
100828 
100991 
101110 
101156 
101164 
101317 
101345 
101475 
101496 
101543 
101560 
101592 
101634 
101682 
101720 
101744 
101837 
101840 
101864 
101966 
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102283 
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T92143 

AA001860 

AA001660 

AA358669 

H04150 

NH.001078 



M86933 

AF055581 

AI557212 

AW631469 

NNL001955 

A1824144 

AA040311 

AA443966 

AW361018 



NMJJ03542 

AB011084 

N92589 

NM.004600 

AA1259S5 

AF234532 

AI186431 

AW021103 

AA403O45 

BE619053 

AI129903 

NM.003003 



AU076964 

L34657 

A1379954 

AW580939 

AI272141 

AW968058 

W55956 

X02308 

AU076465 

NH.002933 

AW291587 

AW886941 

AI752163 

U56725 

BE295928 

AL048753 

J03836 

AI439011 

AA340987 

NM_001674 

L42176 

NM_005795 

BE410405 

X12784 

M31166 

AW958272 

AF064853 

AV650252 

AF043045 

M69043 

A1679352 

M92843 

AA236291 



X96438 

BE616287 

A1752666 

AW161552 

AU076887 



102560 102560 

102589 102589 

102645 102645 

102693 102693 

102759 102759 



BE243877 

R97457 

AU076728 

AL119566 

AA532780 

ML005100 



Hs.57958 

Hs.279531 

Hs279531 

Hs.227949 

Hs.107708 

Hs.109225 

Hs.110802 

Hs.1238 

Hs.13131 

Hs.17132 

Hs303213 

Hs^271 

Hs.23912 



Hs31595 

K&3383 

Hs279663 

Hs.46423 

Hs.46924 

HS261038 

Hs.554 

Hs^6145 

Hs.61538 

Hs.296638 

Hs.6631 

Hs.6906 

Hs,1700Q1 

Hs.74669 

Hs.75232 

Hs211573 

Hs.7753 

Hs.76146 

Hs.79025 

Hs.97199 ' 

Hs.83484 

Hs.92381 

Hs.94030 

Hs.82962 

Hs.278441 

Hs.78224 

Hs.82733 

Hs.75789 

Hs. 114599 

Hs.180414 

Hs.75424 

Hs.303649 

Hs.82085 

Hs.86386 

Hs.75693 

Hs.460 

Hs.8302 

Hs.152175 

Hs.76288 

Hs.119129 

Hs.2050 

Hs^47326 

Hs.91299 

Hs.75765 

Hs.81008 

Hs.81328 

Hs.1 18625 

Hs.343586 

Hs.183583 

Hs.75777 

Hs,76095 

Hs.178452 

Hs.76669 

Hs.83381 

Hs.28491 

Hs.211582 

Hs.76941 

Hs.63984 

Hs.8867 

Hs.6721 

Hs.183684 

Hs.788 



EGF-TM7-tetrophlSiHBlatBd protein 

ESTs 

ESTs 

SEC13(S.cenevisiaeHike1 
ESTs 

vascular ceD adhesion molecule 1 
von Wfltebrand factor 
amelogenin (Y chromosome) 



ESTs, Moderately sinter to 154374 gene 
ESTs 



ESTs • 

ESTs 

ESTs 

upstream regulatory element binding prot 
PMn 

H4 hbtone family, member G 
KIAA0512 gene product ALEX2 
ESTs, Weakly similar to 138022 hypofhefl 
Sjogren syndrome antigen A2 (60VO, ribon 
thymosin, beta, identified in neuroblast 
myosinX 

prostate differentiation factor 
hypothetical protein FU20373 
Homo sapiens cDNA: FU23197 fis, clone R 
eukaryoflc translation initiation factor 
vesicle-associated membrane protein 5 (m 
SEC14(S.cerevfeiaeHke1 ■ 
heparan sulfate proteoglycan 2 (periecan 
catumenki 

platelet/ertdothelial ceS adhesion molec 
WAA0096 protein 

complement component C1q receptor 
SRY (sex determining region Y)-box 4 
nudix (nucleoside diphosphate Dnked moi 
Homo sapiens mRNA; cONA OKFZp586E1624 (f 
thymidyiata synthetase 
WAA0015 gene product 
ribonudease, RNase A family, 1 (pancrea 
nJdogen2 

N-myc downstream regulated 
collagen, type VIII, alpha 1 
heat shock 70kO protein 2 
Inhibitor of ONA binding 1, dominant neg 
small Inducible cytokine A2 (monocyte ch 
serine (or cysteine) proteinase tnhibito 
myeloid cell leukemia sequence 1 (BCL2-r 
protylcajboxypeptloase (angiotenslnase C 
activating transcription factor 3 
four and a half UM domains 2 
calcitonin receptor-like 
calpaln Z (m/II) large subunit 
collagen, type IV, alpha 1 
pemaxln-related gene, rapidly induced b 
intercellular adhesion molecule 2 
guanhe nucleotide bindhg protein (G or 
6R02 oncogene 

rUamin B, beta (adln-blndhg prtrtetn-2 
nuclear factor of kappa Dght polypeptid 
hexokinasel 

zinc finger protein homologous to 2fp-36 
serine (or cysteine) proteinase tnhibito 



immediate early response 3 
catenin (cadhertn-associated protein), a 
nicotinamide N-methyltransferase 
guanine nucleotide binding protein 11 
spermidine/spermine Nl-acetyltransferase 
myosin, fight polypeptide kinase 
ATPase, Na*/K+ transporting, beta 3 poly 
cadherfn 13. H-cadherin (heart) 
cysteine-rich, angiogenic Inducer, 61 
lysosomal 

eukaryotJc translation initiation factor 
A kinase (PRKA) anchor protein (gravin) 
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102382 1Q28S2 
102915 102915 
102960 102960 
103020 103020 
103036 103036 
103080 103080 
103138 103136 
103195 1031% 
103371 103371 
103471 103471 
104447 104447 
104783 104783 
104865 104865 
104894 104894 
105113 105113 
105196 105196 
105263 105263 
105330 105330 
105492 105492 
105594 105594 
105732 105732 
105882 105882 
106031 106031 
106222 106222 
106263 106263 
106366 106366 
106634 106634 
106793 106793 
106842 106842 



AI767736 

X07820. 

AI904738 

X53416 

M13509 

AU077231 



106974 106974 
107061 107061 
107216 107216 
107444 107444 
108507 108507 
108931 108931 
109195 109195 
109456 109456 
110411 110411 
110906 110906 
111091 111091 
111378 111378 
111769 111769 
112951 112951 
113195 113195 
113542 113542 
113847 113847 
113947 113947 
115061 115061 
' 115570 115870 
116228 116228 
116314 116314 
117023 117023 
117156 117156 
117280 117280 
119866 119866 
121314 121314 
121622 121822 
122331 122331 
123160 123160 
124059 124059 
124358 124358 
124726 124726 
125167 125167 
125307 125307 
107985 107985 
125598 125598 
413731 413731 
116024 116024 
418000 418000 
126399 126399 
127566 1Z7566 
128453 128453 
128515 128515 
128623 128623 
128669 128669 



AA351647 

X91247 

Y00815 

AW204145 

AA533513 

T79340 

AF065214 

AB037816 

W84693 

AW388633 

AW338625 

AI805717 

AB024334 

AW504170 

W46802 

X64116 

AA356392 

W21493 

AA186715 

W25491 

H94997 

AF124251 

AA489245 

AI817130 

BE147611 

D510G9 

W28391 

A1554545 

AA147186 

AF047033 

AW956580 

AW001579 

AA035211 

AA300067 

AW160993 

AW6294M 

AA307634 

K83265 

H43374 

NM_005032 

W84768 

AI751438 

NMJ1Q5985 

A1767947 

AJ799104 

AW070211 

W73853 

M18217 

AA496205 

W07343 

AI743860 

AL133437 

AA488687 

BE387335 

AW070211 

NMJJ03654 

AL137540 

AW580945 

T40064 

T40064 

BE243845 

AA088767 

AA932794 

AA088767 

AI051390 

X02761 



W28493 



Hs.290070 

Ha.2258 

Hs.76053 

Hs.195464 

Hs.83169 

Hs.82932 

Hs^642 

Hs.13046 

Hs.75216 

Hs.156044 

Hs.93659 

Hs.22575 

Hs.18858 

Hs.8982 

H&93Q5 

Hs.6682 

Hs.22120 

Hs.289112 

H325001 

Hs.274344 

Hs.81988 

Hs.171844 

Hs^1321 

Hs.26329 

Hs.336429 

Hsmm 

Hs.16450 

Hs.26054 

Hs.88500 

Hs.9195 

Hs.6354 

Hs.211579 

Hs343258 

Hs.68301 

Hs.132904 

Hs.42699 

H&9645 

Hs.17404 

Hs.33032 

Hs.326292 

Hs.24230 

Hs.6650 

HsJ881 

Hs.7890 

H3.4114 

H3.41271 

Hs.48029 

Hs.50841 

Hs.178705 

Hs.102415 

Hs.172129 
Hs.193700 
Hs.182538 

Hs.110771 

Hs.284235 

Hs.283713 

Hs.102415 

Hs.104576 

Hs.102541 

Hs.330466 

Hs.71968 

Hs.71968 

Hs.75511 

Ks-83883 

Hs.83147 

Hs.83883 

Hs.116731 

Hs.28782G 

Hs.10085 

Hs.105509 

Hs.180414 



gelsofin (amyloidosis. Finnish type) 
matrix metaltoprote'rase 10 (strometysin 
DEAQ/H (Asp-GluAb-Asp/His) box polypep 
fQamtn A, alpha {actirvblndlng pratein- 
matitx metaBoproteinase 1 (intsisfiUa) 
cyclln D1 (PRAD1: parathyroid adenomatos 
gbHsaptens SOD-2 gene for manganese su 
eukaiyofic transJailon elongation factor 
thforedoxin reductase 1 
prated tyrosine phosphatase, receptor t 
ESTs 

protein disulfide isomeiase related prat 

B^CLUlymphoma6, member B (zinc fi 

phosphoGpase A2, group tVC (cytosoDc, 

Homo sapiens, done IMAGE3506202. mRNA, 

angiotensin receptor-like 1 

solute canter family 7, (caflonfc amino 

ESTs 

CGW3 protein 

tyrosine 3-mor»oxygenase/tryptophan 5-mo 
hypothetical protein MGC12942 
disabled (DrosophDa) homo log 2 (mitogen 
Homo sapiens cDNA: FU22296 8s, clone H 
Homo sapiens done FLB9213 PR02474 mRNA, 
hypothetical protein FU 14005 
RIKEN cDNA 9130422N19 gene 
hypothetical protein RJ22471 
ESTs 

novel SH2-contalning protein 3 
mltogeivacfjvated protein kinase 8 inter 
Homo sapiens cDNA FU13698 6s, clone PL 
stromal cell derived factor receptor 1 
melanoma ceil adhesion molecule 
proDferatton-associated 2G4, 38kD 
ESTs 

gbao38d01j1 Stratagene endothelial eel 
solute carrier famDy 4, sodium bicarbon 
ESTs 

Homo sapiens mRNA for WAA1741 protein, 
ESTs 

hypothetical proteh DKFZp434N185 
hypothetical gene DKFZp434A1114 
ESTs 

vacuolar protein sorting 45B (yeast homo 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens mRNA for K1AA1671 protein, 
piastin 3 (T Isoiomi) 

gbzh53dQ3.s1 Soares„fetaL(iver_spleen. 
Homo sapiens mRNA full length insert cDN 
snail 1 (drosophQa homotog), zinc hnge 
ESTs 

Homo sapiens cDNA FU1 1333 fis, clone PL 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 

Homo sapiens cDNA: FU21409 fis, clone C 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 



metallothtonein 1E (functional) 

Homo sapiens cDNA: FU21904 hs, clone H 

ESTs, Weakly similar to 138022 hypotheti 

ESTs, Weakly similar to SS4054 hypotheti 

Homo sapiens mRNA; cDNA DKFZp586N01 21 (f 

carbohydrate (keratan sulfate Gal-6) sul 

netrin4 

ESTs 

Homo sapiens mRNA; cDNA OKF2p564F053 (fr 
Homo sapiens mRNA; cONA OKFZp564F053 (fr 
connective tissue growth factor 
transmembrane, prostate androgen induced 
guanine nucleotide binding proteUe 
transmembrane, prostate androgen induced 
ESTs 

GbronecSnl 

type I transmembrane protein Fn14 
CTL2gene 

healshock70kD protein 8 
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128914 128914 AW867491 Hs.107125 plasnralemma vestete associated protein 

129188 129188 NM.001078 Hs.109225 vascular cefl adhesion molecule 1 

129265 129265 AA530892 Hs. 171695 dual spedOcfly phosphatase 1 

129468 129468 AW410538 Hs.111779 secreiBd protein, acidic, cystetoe-rfch 

5 101838 101838 BE243845 Hs75511 connective 8ssue growfo factor 

129619 129619 AA209534 Hs.284243 tsbaspan NET-6 protein 

129762 129762 AA453694 Hs. 12372 tripartite motif protein TRIM2 

130018 130018 AA353093 metatothfanein 1L 

130178 130178 U20982 Hs.1516 insufovGke growth factor-binding prote 

10 130431 130431 AW505214 Hs.155560 caJnexln 

130553 130553 AF0 62649 Hs.252587 pituitary tumor-transforming 1 

130639 130639 AI557212 Hs.17132 ESTs, Moderatety similar to 154374 gene 

130686 130686 BE548267 Hs.337986 Homo sapiens cDNA RJ 10934 fis, done OV 

130818 130818 AW190920 Hs.19928 hypothetical protein SP329 

15 130699 130899 AI077288 Hs.296323 serorrtgiucooorticold regulated kinase 

131080 131080 NM_001955 Hs.2271 endotheto 1 

131091 131091 AJ271216 Hs.22880 oTpeplldytpeptidase (il 

131182 131182 A1824144 Hs.23912 ESTs 

131319 131319 NM-003155 Hs.25590 starmfecaWnl 

20 131328 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 

131328 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 

131555 131555 T47364 Hs.278613 Interferon, alpha*dudbte protein 27 

131573 131573 AA040311 Hs.28959 ESTs 

131756 131756 AA443966 HsJ1595 ESTs 

25 131909 131909 NM.016558 Hs.274411 SCAN dcmairKXintahlng 1 

132046 132046 A1359214 Hs.179260 chromosome 14 open reading frame 4 

132151 132151 BE379499 Hs.173705 Homo sapiens cDNA: FU22O50 fe, done H 

132187 132187 AA235709 Hs.4193 DKFZP58601624 protein 

132314 132314 AF 11 2222 Hs.323808 pinin, desrnoscme associated protein 

30 132398 132398 AA876616 Hs.16979 ESTs, Weakly sirdar to A43932 mucin 2 p 

132490 132490 NMJ501290 Hs.4980 UM domain binding 2 

132546 132546 M24283 Hs.168383 Intercellular adhesion molecule 1{C054) 

132716 132716 BE379595 Hs.283738 casein kinase 1, a^na 1 

132883 132883 AA373314 Hs.5897 Homo sapiens mRNA; cDNA DKFZp5B6P1622 (f 

35 132989 132989 AA480074 Hs.331328 hypoflietjcal protein RJ1321 3 

133071 133071 BE384932 Hs.64313 ESTs, Weakly similar to AF257182 1 G-pro 

133099 133099 W16518 Hs.279518 amyloid beta (M)pracuisor^ 

133149 133149 AA370045 Hs.6607 AX1N1 up-regulated 

133200 133200 AB037715 Hs. 183639 hypoftetJcal protein FU10210 

40 133260 133260 AA403045 Hs.6906 Homo sapiens cDNA; FU23197 fis, clone R 

133349 133349 AW631255 Hs.8110 L-3^yaraxyacy^Coenzyme A dehydrogenase 

133398 133398 NMJJ00499 Hs.72912 cytochrome P450, subfamily I (aromatic c 

133454 133454 BE547647 Hs.177781 hypoSieticalprote&i MGC5618 

133491 133491 BE619053 Hs.170001 eukaryotic translation initiation factor 

45 133517 133517 NM_00Q165 Hs.74471 gap Junction protein, alpha 1,43kO {con 

133538 133538 NM_003257 Hs.74614 tight junction protein 1 (zona ocdudens 

133584 133584 090209 Hs.181243 activating transcription factor 4 (tax-r 

133617 133617 BE244334 Hs.75249 ADP-ribosylatlon factor-like 6 interact] 

133671 133671 AW503116 Hs.301819 zinc finger protein 146 

50 133681 133681 AI352558 tyrosine 3-rnoncoxygenase/tryptoprian 5-mo 

133730 133730 0E242779 Hs.179526 unregulated by 1 254ihydroxyvttamln 

133802 133802 AW239400 Hs.76297 G proieirvcoupled receptor kinase 6 

133838 133838 BE222494 Hs.1 80919 inhibitor of DNA btrefing 2, dominant neg 

133889 133889 U48959 Hs.211582 myosin, Bght polypeptide kinase 

55 133975 133975 C18356 Hs.295944 tissue {actor pafliway Inhibitor 

134039 134039 NM_CO2290 Hs.78672 teminin, alpha 4 

134081 134081 AL034349 Hs.79005 protein tyrosine phosphatase, receptor t 

134203 134203 M161219 Hs.799 diphtheria toxin receptor (hepartrhbind} 

134299 134299 AW580939 Hs.97199 complement component C1q receptor 

60 134339 134339 R70429 Hs.81968 dteaoled (Orosophita) homohag 2 (mitogen 

134381 134381 AJ557280 Hs. 184270 capping protein (actin filament) muscle 

134416 134416 X68264 Hs.211579 melanoma cell adhesion molecule 

134558 134558 NM.001773 Hs.85289 CD34 antigen 

134983 134983 028235 Hs. 196384 proslagiarrtn^paroxkte synthase 2 (p 

65 135052 135052 AL136653 Hs.93875 decidual protein induced by progesterone 

135069 135069 AA876372 Hs.93961 Homo sapiens mRNA; cDNA DKFZp667D095 (fr 

135073 135073 W55956 Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

135196 135196 C03577 Hs.9615 myosin regulatory Dght chain I smoofli 

134404 134404 AB000450 Hs.82771 vaccinia related kinase 2 

70 100082 100082 AA130080 Hs.4295 proteasome (prosome, macropain) 25S subu 

130150 130150 BE094848 Hs.15113 homogentisate 1,2-dioxygenase {nomogent] 

130839 130839 AB011169 H&20141 simrTartoS.cerevtslaeSSM4 

100113 100113 NM.001269 Hs.84746 chromosome condensation 1 

1(30129 100129 AA469369 Hs.5831 tissue Inhibitor of metallopnoteinase 1 

75 100169 100169 AIX37228 HsJ2043 D123 gene product 

100190 100190 M91401 Hs.178658 RAD23 (S. cerevistae) homotog B 



113 



WO 02/079492 



PCT/US02/04915 



100211 100211 
130283 130283 
100248 100248 
100262 100282 
5 100281 100281 
100327 100327 
134495 134495 
135152 135152 
100372 100372 

10 100394 100394 
100418 100418 
134347 134347 
100438 100438 
100481 100481 

15 100591 100591 
100662 100662 
100905 100905 
100950 100950 
135407 135407 

20 131877 131877 
134786 134786 
134078 134078 
134849 134849 
101152 101152 

25 131687 131687 
421155 421155 
133975 133975 
130155 130155 
132813 132813 

30 101300 101300 
130344 130344 
101381 101381 
133780 133780 
101447 101447 

35 101470 101470 
101478 101478 
133519 133519 
134116 134116 
130174 130174 

40 132983 132983 
101543 101543 
101620 101620 
133595 133595 
101700 101700 

45 134246 134246 
133948 133948 
133948 133948 
133948 133948 
101812 101812 

50 133396 133396 
129026 129026 
134831 134831 
134395 134395 
101977 101977 

55 101998 101998 
102007 102007 
416658 416658 
135389 135389 
130145 130145 

60 420269 420269 
102123 102123 
102133 102133 
102162 102162 
427653 427653 

65 102200 102200 
102214 102214 
131319 131319 
132316 132316 
134365 134365 

70 102298 102298 
302344 302344 
102367 102367 
102394 102394 
129521 129521 

75 102251 102251 
133746 133746 
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MNL012288 


Hs.153954 


WI015156 


H3.78398 


038500 


Hs.278468 


AF091035 


Hs.184627 


D55640 




D63477 


Hs.84087 


M96954 


Hs.182741 


NMJM4791 


Hs.184339 


D84284 


Hs.66052 


086978 


Hs.84790 


AF164142 


Hs.82042 


AA013051 


Hs.91417 


X70377 


Hs.121489 


NNL004091 


Hs.231444 


AI368680 


H3.816 


L12260 


Hs.172816 


AF128542 


Hs.166846 


J04029 


Hs.99936 


J04088 


Hs.156346 


T29618 


Hs*9640 


108895 


Hs. 78995 


BE409525 


Hs.902 


AI984625 


Hs.9884 


BE297635 


H3J3069 


H87879 


Hs.102267 


C18356 


Hs.295944 


AA 101 043 


Hs.151254 


BE313625 


Hs.57435 


BH535511 




AW250122 


Hs.154879 


AW675039 


Hs.1227 


AA557660 


Hs.76152 


M21305 




NMJ100546 


Hs.1846 


NM_002890 


Hs.758 


AW583062 


Hs.74502 


R84694 


Hs.79194 


M29551 


Hs.151531 


M30269 




M31166 


Hs2050 


S55271 


HsJM7930 


AA393273 


Hs.75133 


D90337 


Hs.247916 


D28459 


Hs.80612 


X59960 


Hs.77813 


X59960 


Hs.77813 


X59960 


Hs.77813 


BE439894 


Hs.78991 


M96326 


Hs.72885 


AL120297 


Hs.108043 


AA853479 


Hs.89890 


AA456539 


Hs.8262 


AF112213 


Hs.184062 


U01212 


Hs.248153 


U02556 


Hs.75307 


U03272 


Hs.79432 


U05237 


Hs.99872 


U34820 


Hs.151051 


U72937 


Hs.96264 


NM.001809 


Hs.1594 


AU076845 


Hs.155596 


AA450274 


Hs.1592 . 


AA 159001 


Hs.180069 


AA232362 


Hs.157205 


U23752 


Hs.32964 


NM.003155 


HsJ5590 


U28831 


Hs.44566 


AA568906 


Hs.82240 


AA382169 


H3.54483 


BE303044 


Hs.192023 


U39656 


Hs.118825 


NNLQ03816 


Hs.2442 


AF071076 


Hs.112255 


NMJXM398 


Hs.41706 


AW410035 


Hs.75862 



OEAD/H (Asp£hMlfrAsp/His) box pofypep 
TRAM-Qce protein 
KIAA0071 protein 

postrndotk/ scQjtfjHpon oicrBSS&d 2~GRb 
WAA0118 protein 

gbHuman monocyte PABL (pseudoautosomal 
WAA0143 protein 

TTA1 cytotodc granule-associatBd RNA-bi 
K1AA0175 gene product 
CD38 antigen (p45) 
K1AAD225 protein 

solute canter famfiy 23 (rtudeobase tra 
topobomerase (DNA) ti binding protein 
cystefinD 

Homo sapiens, Similar to hypothetical pr 

COV /mw jL.Lut.J-J— - mhUa VUuw O 

oKt (sexoetsnnBung region T}-oox l 
nsureguDn 1 

polymerase (DMA directed), epsifon 
kerafln 10 (epidermorytic hyperteratosis 
topdsomerase (DNA) II alpha (170kD) 
TEK tyrosine kinase, endothelial (venous 
MADS box transcription enhancer factor 2 
neurofibroma 2 (bilateral acoustic neur 
spindle pole body protein 
heat shock 70kD protein 9B (mortaGn-2) 
rysyt oxidase 

tissue factor pafoway inhibitor 2 
kaHikrein 7 (chyrnotryptic, stratum com 
solute carrier family 1 1 (proton-coupled 
transrnernbrane trafficking protein 
DiGeorge syndrome critical region gene D 
arntnotevuCnaia, delta-, dehydratase, 
deoorin 

gtoHuman alpha satellite and satellite 3 
tumor protein p53 (U-Fraumenl syndrome) 
RAS p21 protein activator (GTPase activa 
chymotr/psinogen B1 
cAMP responsive element binding protein 
prote'n phosphatase 3 (formerly 2B), cat 
nidogen (enacfin) 

pentaxirwelated gene, rapidly Induced b 
Epsilon , igE 

transcription factor 6-Cke 1 (mitochond 
natriuretic peptide precursor C 
ubiquifin-conjugafing enzyme E2A (RAD6 h 
sphingomyelin phosphodiesterase 1 , acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
DNA segment numerous copies, expressed 
azurocidin 1 (cauonic antimicrobial pro 
Friend leukemia virus Integration 1 
pyruvate carboxylase 
lysosomal 

putative Rab5-interacting protein 
olfactory marker protein 
tKX>nipJex-assocfated48stis-expfessed 1- 
fDbrUQn 2 (congenital contractural ara 
fetal Alzheimer antigen 
mttogen-ac&vated, protein kinase 10 
alpha thalassemia/mental retardation syn 
centromere protein A (17kD) 
BCL2/adenoviru3 E1B 19kD^teracting pro 
CDC16 (cell division cycle 16, S. cerevi 
nuclear respiratory factor 1 
branched chain aminotransferase 1, cytos 
SKY (sex determining region Y>box 1 1 
stanniocaicrn 1 
KIAA1641 protein 
syntaxin3A 

N-myc (and STAT) interactor 
eukaryooc translation initiation factor 
rnitogen-activated protein kinase kinase 
a dislntegrin and metaitoprotelnase doma 
nudeoporin 98kD 

DEAD/H (Asp^Iu^Asp/Hls) box polypep 
MAD (mothers against decapentapleglc, Dr 
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132628 132828 AB014615 Hs.57710 fibroblast growth tactor 8 (andr^en-lnd 

132828 132828 AB014615 Hjl57710 fibroblast growth factor 8 (androgervhd 

130441 130441 U63630 Ha 155637 protein kinase, DNA-acfivated, catalytic 

129350 129350 U50535 Hs.110630 Human BRCA2 region, mRNA sequence CG006 

5 130457 130457 AB014595 Hs. 155976 cuffin4B 

102560 102560 R97457 Hs.63384 cadhertn 13, H-cadherin (heart) 

134305 134305 U61397 Hs.81424 ubtquiSn-Uke 1 (sentrin) 

132736 132736 AW031883 Hs311578 Homo sapiens cDNA: FU23037 fis, done L 

102663 102663 NM.002270 Ha. 168075 karyopherin (Imporfln) beta 2 

10 102735 102735 AF111106 Hs.3382 protein phosphatase 4, regulatory subunl 

101175 101175 U82671 Hs.36980 melanoma antigen, family A, 2 

132164 132164 AI752235 Hs.41270 proconagen-fystne, 2-cxoglutarate 5-dto 

102826 102826 NH.007274 Hs.8679 cytosoOc acyl coenzyme A thloester hydr 

102646 102846 BE264974 Hs.6566 thyroid hormone receptor Interactor 13 

15 134161 134161 AA634543 Hs.79440 tGF-ll mRNA-blndirg protein 3 

302363 302363 AW1S3799 Hs.198365 2>bbphosphogtyceratemutase 

125701 125701 T72104 Hs.93194 apoprotein A4 

134656 134656 AI750878 Hjl87409 thrornbcspondin 1 

102968 102968 AU076611 Hs. 154672 methylene tetrahydrofolate derrydrogenase 

20 134037 134037 AI808780 Hs^2773Q integrin,alpha6 

103023 103023 AW500470 Hs.117950 multifunctional polypeptide simBar to S 

130282 130282 BE245380 Hs.153952 5* nucleotidase (CD73) 

128568 128568 H12912 Hs.274691 adenylate kinase 3 

103093 103093 S79876 Hs.44926 dipeptidytpeptidase IV (C026, adenosine 

25 129063 129063 X63094 Hs.283822 Rhesus blood group, D antigen 

133227 133227 AW977263 Hs.68257 general transcription factor OF, polype 

103184 103184 U43143 Hs.74049 fmswelated tyrosine kinase 4 

103208 103208 AW411340 Hs.31314 retmoblastoma-binding protein 7 

131486 131486 F06972 Hs.27372 BMX non-receptor tyrosine kinase 

30 103334 103334 NM_001260 Hs.25283 cyc^epwdent kinase 8 

135094 135094 NM_003304 Hs.250687 transient receptor potential channel 1 

103352 103352 H09366 Hs.78853 uraclU3NA glycosylate 

132173 132173 X89426 Hs.41716 endothelial celkpedflc molecule 1 

131584 131584 AA598509 H^29117 purine-rteh element binding protein A 

35 103378 103378 AU1969Q Ms.153618 HCGVUI-1 protein 

103410 103410 AA158294 H3.2953B2 DR1 -associated protein 1 (negative cofac 

103438 103438 AW175781 Hs.1 52720 M-phase phosphoprotein6 

103452 103452 NNL006936 Hs.85119 SMT3 (suppressor of mff two 3, yeast) ho 

135185 135185 AW404908 Hs.96038 Rfc(Drosophlla}^, expressed In many 

40 134662 134662 NM.007048 Hs.284283 butyrophilin, subfamily 3, member A 1 

103500 103500 AW408009 Hs.22580 alkyigtycerone phosphate synthase 

132084 132084 NrM_002267 Hs.3886 karyopherin alpha 3 (Importin alpha 4) 

133152 133152 Z11695 Hs.324473 maogerrtcfivated protein kinase 1 

103612 103612 BE336654 Hs.70937 H3 histone family, member A 

45 103692 103692 AW137912 Hs.227583 Homo sapiens chrornosomeX map Xp1 1.23 L- 

129796 129796 BE218319 Hs.5807 GTPaseRab14 

132683 132683 BE264633 Hs. 143638 WD repeat domain 4 

103723 103723 BE274312 Hs.214783 Homo sapiens cONA FU 14041 fis, ctone HE 

133260 133260 AA403045 Hs.6906 Homo sapiens cDNA: RJ23197 fis, clone R 

50 103766 103766 A1920783 Hs.191435 ESTs 

132051 132051 AA393968 Hs.180145 HSPC030 protein 

135289 135289 AW372569 Hs.9788 hypotheflcal proteh MGC10924 similar to 

103794 103794 AF244135 Hs.30670 hepatocellular carcmoma-associated and 

134319 134319 BE304999 Hs.285754 fumarate hydratase 

55 119159 119159 AF142419 Hs.15020 homoiog of mouse quaking OKI (KH domain 

103850 103850 AA187101 Hs.213194 hypothetical protein MGC10895 

322026 322026 AW024973 Hs.283675 NPD009 protein 

103861 103861 AA206236 Hs.4944 hypothetical protein FU12783 

447735 447735 AA775268 Hs.6127 Homo sapiens cONA: FU23020 fis, clone L 

60 131236 131236 AF043117 Hs.24594 ublquitinatton factor E4B (homologous to 

129013 129013 AA371156 Hs.107942 OKFZP564M1 12 protein 

103988 103988 AA314389 Hs.342849 ADP-ribosytation factor-like 5 

425284 425284 AF155568 Hs.348043 NS1 -associated protein 1 

133281 133281 AK001601 Hs.69594 high-mobility group 20A 

65 108154 108154 NM.005754 Hs.220689 Ras-GTPase-activatlng protein SH3-domain 

135073 135073 W55956 Hs.94030 Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

129593 129593 AI338247 Hs.98314 Homo sapiens mRNA; cONA DKFZp566L0120 (f 

132064 132064 AA 121 098 Hs.3838 semnvmducible kinase 

131427 131427 AF151879 Hs.26706 CGM21 protein 

70 104282 104282 C14448 Hs.332338 EST 

130443 130443 025216 Hs.155650 WAA0014 gene product 

132637 132837 AA370362 Hs.57958 EGF-TM7-iatrophHliMelated protein 

104334 104334 D82614 Hs.78771 phosphoglyoerate kinase 1 

134731 134731 D89377 Hs.89404 msh (DrosophOa) homeo box homoiog 2 

75 131670 131670 H03514 Hs.15589 ESTs 

104402 104402 H56731 Hs.132956 ESTs 
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129077 129077 
134927 134927 
134498 134498 
104488 104488 
5 129214 129214 
104530 104530 
104544 104544 
104557 104567 
129575 129575 

10 104599 104599 
104667 104667 
104764 104764 
104787 104787 
104604 104804 

15 130828 130828 
104943 104943 
105024 105024 
105038 105038 
105096 105096 

20 105169 105169 
130401 130401 
130114 130114 
105337 105337 
105376 105376 

25 131962 131962 
128658 126658 
105508 105508 
135172 135172 
132542 132542 

30 105659 105659 
105674 105674 
105722 105722 
115951 115951 
105985 105985 

35 131216 131216 
113689 113689 
130839 130839 
130777 130777 
106196 106196 

40 133200 133200 
106328 106328 
106423 106423 
439608 439608 
106503 106503 

45 106543 106543 
106589 106589 
106596 106596 
106636 106636 
131353 131353 

50 131710 131710 
131775 131775 
106773 106773 
106817 106817 
106848 106848 

55 418699 418699 
130638 130638 
107059 107059 
107115 107115 
107156 107156 

60 130621 130621 
132626 132626 
131610 131610 
107295 107295 
107315 107315 

65 107328 107328 
134715 134715 
129938 129938 
130074 130074 
132036 132036 

70 113857 113857 
130419 130419 
132616 132616 
132358 132358 
125827 125827 

75 107609 107609 
107714 107714 



K74724 Hs.108479 

L36531 HS31296 

AW246273 Hs.84131 

N56191 Hs.106511 

AL044335 Hs. 109526 

AK001676 Hs.12457 

AI091173 Hs.222362 

AA040620 H&5672 

F08282 H&278428 

AW815036 Hs.151251 

A1239923 Hs.63931 

A1039243 Hs.278535 
AA027317 

AI858702 H3.31803 

AW631469 Hs.203213 

AF072873 Hs.114218 

AA126311 Hs.9879 

AW503733 Hs.9414 

AL042506 Hs.21599 

BE245234 Hs.160789 

BE396283 Hs.1 73987 

AA233393 Hs.1 4992 

AI468789 Hs.347187 

AW994032 Hs.8768 

AK000046 Hs.343877 

BE397354 Hs.324830 

AA173942 Hs.326416 

AB028956 Ks.12144 

AL137751 Hs.263671 

AA283044 Hs.25625 

AI609530 Hs.279789 

AI922821 Hs.32433 

BE546245 Hs.301048 
AA406610 

AIB15466 Hs.243901 

AB037850 Hs.16621 

AB011169 Hs.20141 

AW135049 Hs.26285 

AA525993 Hs.173699 

AB037715 Hs.183639 

AL079559 Hs.28020 

AB020722 Hs.16714 

AW864696 Hs.301732 

AB033042 Hs.29679 

AA676939 Hs.69285 

AK000933 Hs.28661 
AA452379 

AW958037 Hs.286 
AW754182 

NMJ)15368 Hs.30985 

AB014548 Hs.31921 

AA478109 Hs.188833 

D61216 Hs.18672 

AA449014 Hs.121025 

BE539639 Hs.173030 

AW021276 Hs.17121 

BE614410 Hs.23044 

BE379623 Hs.27693 

AA137043 Hs.9663 

AW513087 Hs.16803 

AW504732 Hs.21275 

AA357879 Hs.29423 

AA186629 Hs.80120 

AA316241 Hs.90691 

AW959891 HS76591 

U48263 Hs.69040 

AW003668 Hs.135587 

AL038596 Hs.250745 

AL157433 Hs.37706 

AW243158 Hs.5297 

AF037448 Hs.155489 

BE262677 Hs.283558 

NNL003542 Hs.46423 

NM.003403 Hs.97496 

R75654 Hs.164797 

AA015761 Hs.60642 



ESTs 

Integrin. alpha 6 

IhreonyMRNA synthetase 

protocadherh 17 

zinc linger protein 198 

hypothetical protein HJ 10814 

ESTs, Weakly similar to p40 [H^aplans] 

hypothetical protein AF140225 

progesfin induced protein 

ESTs 

ESTs 

ESTs 

gb3B97d1U1 Scares_fetaLheait_NbHH19W 
ESTs, Weakly similar to N-WASP [H^aplen 
ESTs 

frizzled (Drosophila) homotog 6 
ESTs 

K1AA1488 protein 
Kruppet^ke factor 7 {ubiquitous) 
S164 protein 

eukaryotlc translation Inifiatlon factor 
hypothetical protein RJ11151 
myotubularin related protein 1 
hypolhefical protein FU10849 
hypothefcal protein RJ 20039 
diptheria toxin resistance protein requl 
Homo sapiens mRNA; cDNA DKFZp564H1916 {f 
WAA1033 protein 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 
hypothetical protein RJ 11323 
hJstone deacetytase 3 
ESTs 

sec13-£ks protein 

gb:zv15b10.s1 SoaresJfoHMPu_S1 Homo sapi 

Homo sapiens cONA FU20738 tis, clone HE 

DKFZP434I116 protein 

similar to S. cerevisiae SSM4 

Homo sapiens cDNA RJ 10643 tis, clone NT 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

hypothetical protein RJ10210 

KIAA0766 gene product 

Rho guanine exchange factor (GEF) 15 

hypothetlcat protein MGC5306 

cofactor required for Sp1 transcriptiona 

neuropHtn 1 

Homo sapiens cONA FU10071 fis, clone HE 
ESTs, Moderately similar to ALU7_HUMAN A 
ribosomal protein L4 

gb:RC2-CT0321-131199411-c01 CT0321 Homo 

pannexin 1 

WAA0648 protein 

ESTs 

ESTs 

chromosome 11 open reading frame 5 
ESTs, WeaHy simitar to ALU8_HUMAN ALU S 
ESTs 

RAD51 (S. cerevisiae) homobg (E coll Re 
peptidytorolyl isomerase (cyctophtlinH 
programmed cell death 6-interactIng prat 
LUC7 (S. cerevlslaeHlke 
hypothetical protein RJ11011 
scavenger receptor with C-type lectin 
UDP^^cetyWpha-f>galactosamine:potyp 
nudeophos min/nudeopiasmln 3 
KIAA0887 protein 
prepronocfoeptin 

Human done 23629 mRNA sequence 
polymerase (RNA) 111 (DMA directed) (62k 
hypothetical protein DKFZp434E2220 
DKF2P564A2416 protein 
NS1-assodated protein 1 
hypothetical protein PR01655 
H4 hlstone family, member G 
YY1 transcription factor 
hypothetical protein FU13693 
ESTs 
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107832 107832 


AA021473 




124337 124337 


N23541 


Hs.281561 


129577 129577 


N75346 


Hs.308121 


132000 132000 


AW247017 


H3.36978 


107935 107935 


AA029428 


Ks.61555 


131461 131461 


AA992841 


Hs.27263 


108029 103029 


AA040740 


Hs.62007 


108084 108084 


AA058944 


Hs.116602 


108168 108168 


AM53137 


Hs.63176 


108189 108189 


AW376081 


Hs.63335 


108203 108203 


AW847814 


H&289005 


108217 108217 


AA058686 


Hs.62588 


108277 108277 


AA064859 




108309 108309 


AA089818 




108340 108340 


AA069820 


Hs. 180909 


108427 108427 


AA076382 




108439 108439 


AA078986 




108469 108469 


AA079487 




108501 108501 


AA063256 




106562 108562 


AA100796 




130890 130890 


AI907537 


Hs.76698 


130385 130385 


AW067800 


Hs.155223 


108807 108807 


A1652236 


Hs.49376 


108833 108833 


AF 188527 


Hs.61661 


108846 108646 


All 17452 


Hs.44155 


131474 131474 


L46353 


Ha.2726 


108941 108941 


AA148650 




108996 108996 


AW995610 


Hs.332436 


131183 131183 


A1611807 


H5L285107 


109022 109022 


AA157291 


Hs.21479 


109068 109068 


AA164293 


Hs.72545 


129021 129021 


AL044675 


Hs.173081 


109146 109146 


AA176589 


Hs.142078 


131080 131080 


NM_001955 


Hs.2271 


109222 109222 


AA192833 


Hs.333512 


109481 109481 


AA878923 


Hs.289069 


109516 109516 


AI471639 


Hs.71913 


109556 109556 


A1925294 


Hs.87385 


109578 109578 


F02208 


Hs.27214 


109625 109625 


H29490 


Hs.22697 


109648 109648 


H17800 


Hs.7154 


109699 109699 


H18013 


Hs.167483 


109933 109933 


R52417 


Hs.20945 


110039 110039 


H11938 


Hs.21907 



gbze66d 1 js\ Soares retina N2b4HR Homo 
Homo sapiens cDNA: FU23582 0s, done L 
CDC20 (cell division cycle 20, S. cerevl 
melanoma anflgen, family A, 3 
ESTs 

KIAA1458 protein 
ESTs 

Homo sapiens, clone IMAGB41 54008, mRNA, 
ESTs 

ESTs, Moderately similar to A46010 X-Bn 
Homo sapiens cDNA: FU21532 fls, done C 
ESTs 

gfczm50103 .si Stratagene fibroblast (937 
gbzm57e03 j1 Stratagene rteuroepitheiium 
peroxfredoxin 1 

gb:zm91g08.$1 Stratagene ovarian cancer 
gb:zm92h0U1 Stratagene ovarian cancer 
gbzm97108.s1 Stratagene colon HT29 (937 
gb2n08g1Zs1 Stratagene hNT neuron {937 
gb3m26c06 M Stratagene pancreas (93720 
stress-associated endoplasmic reticulum 
stanniocaloin 2 
hypoftefical protein RJ20644 
ESTs, WeaWy similar to AF174605 1 F-box 
DKFZP586G1517 protein 
high-mobfUty group (nonhistone chromoso 
gb2o09e06.s1 Stratagene neuroepithefium 
EST 

hypofheucal protean FU 13397 

ubinucteinl 

ESTs 

KIAAO530 protein 
EST 

endotheCn 1 

similar to rat myomegalln 

hypothetical protein FU21016 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 
histone acetyl transferase 
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TABLE 2A 



Table 2A shows (he accession numbers for those pkeys lacking unlgeneiD's for Table 2. The pkeys In Tabfa 7 lacking unigenelTJs are represented within 
Tables 1-6 A. For each probesei we have listed the gene duster number from which lha oEgonudeolkJes were designed Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAa, These secruerwes were dustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each duster are fisted In the 'Accession" 



10 Ptey. Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 Pkey CAT Number Accession 



116761J 
13684.-12 
36375.1 
4669J 



108469 
108501 
108562 
20 101300 



25 



30 117156 145392.1 



132983 



133681 13893J 



35 



40 



45 



50 



55 



121335 
60 130018 
121822 



65 

108309 
107832 
123523 

70 123964 
118475 
104787 
106596 
113947 

75 108277 



1704098J 
11922.1 



AA079487 AA128547 AA128291 AA079587 AA079600 
AA083256 

AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 
BE53551 1 M62098 AA306767 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 
AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA448690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 AA035104 R76028 AA126924 AA7410B6 AW022056 
AW118940 AA121666 AI832409 AA683475 AI140901 AI623576 AW519064 AW474125 AI953923 AI735349 AW150109 
AJ436154 AW1 18130 AW270782 A1804073 N27434 AA876543 AA937815 AI051 166 AA505378 A1041975 AI335355 
AI089540 AA662243 Al 127912 AI925604 AI250880 AJ366874 AI564386 A1815196 AI683526 AM35885 All 50934 H79030 
AJ801493 AA448691 AI673767 AKJ76042 AJ804327 AA813438 AA680002 A1274492 T16177 A1287337 AI935050 
AA907805 AA91 1493 AI58941 1 AJ371358 AW576236 AI078866 AW5161 68 AA346372 AI560185 AA471009 R75857 
AA296025 AA523155 AA853168 A1696593 AJ658482 A1566601 AW072797 AA128047 AA035502 AW243274 AA992517 
R43760 

W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AI079403 AI370541 AI697341 
H97538AW188021A]927663V\ff2716AI0514(HA)1^ 
A1144448 W73819 AA604358 N28900 W95221 AI868132 H98465 AA148793 
R20840 R20839 

M30269 NNL002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 
AA333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 AI818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 D56097 D56381 
W21 232 AA1 90565 AW379755 AW067895 

AI352558 Z82248 X78138 NMJXJ3405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614089 BE296713 
BE621334 120422 X80536 D54224 D54950 X57345 N 29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA2S6774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H1 1706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA32291 1 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85828 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 AI922133 A1697089 N99662 AW1 89078 Al 199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 A1635674 AW131665 AI473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
A1082545 AW468176 AI608761 A1082748 AI911682 AI248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AI470204 AI188296 AI421367 AI125315 AJ087141 AA629032 AA740589 AI554181 
AA150830 AI248541 AI077943 AA775958 AA864930 A1261476 AI123121 A1310394 AA862331 AA872478 BE537084 
AJ 205605 AA720684 AI872093 AW150042 AL12053B AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 A1359620 AA150478 AI359738 AW383642 AW995424 Ai766457 R56892 AKW9839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 A(666231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW0891 53 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 AI81 1102 A181 1 1 16 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 A1917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H11617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AJ579932 AI085941 AW152629 
AA404418AI217248 

AA353093 AW957317 AW872498 AI560785 AI289110 AW135512 X97261 T68873 

AI743860 N49543 AW027759 BE349467 A1656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI61 1424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI61 161 1 AI61 1407 AI61 1452 W56437 A1284566 
A1583349 AW183058 A1308085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AJ432414 A1984994 A1539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 AI832944 A1038530 AA425107 AI014986 AI148349 AW237721 AW779756 
AW137877 AI125293 AA400404 R2B554 
AA069818 AA069971 AA069923 AA0699O8 
genbank_AA021473 AA021473 
genbankJW608588 AA608588 
genbank.C13961 C13961 
genbankJI66845N66845 
genDankJWOTtf AA027317 

304084.1 A1583948 AA578212 AW303715 AA653450 AA456981 AI400385 W88533 AI2241 33 AW272145 AA088686 R94698 

genbank.W84768W84768 

genbankJW)64859 AA064859 



279548.1 
18986.1 
244391.1 



111495J 
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108427 genbanKJ\A076382 AA076382 

108439 genbankJW>78986 AA078986 

131353 231290J AW411259 H23555 AW015049 AJ684275 AW015886 AW068953 AW014085 AI027250 R52686 AA918278 All 29462 

AA969360 N34869 A1948416 AA534205 AA702483 AA705292 

101447 entre^M21305 M21305 

108931 genbanlLAA147186 AA147186 

108941 genbanKMI 48650 AA 148650 

103138 entreO<65965 X65965 

119174 genbanLR71234 R71234 

119416 genbanlLT97188 T97186 

105985 genban)LAA406610 AA406610 

100327 entre?_D55640 D55640 
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TABLE 3: 



Ptey: 
Acces 
ExAocn: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Unique Eos probeset Identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unlgene number 
UnJgeneTife: Unlgene gene Alls 



Pkey Accession ExAccn UniGene Unigeneritte 



100405 086425 
100420 086983 
100481 HG1098-HT1098 
100484 HG1103-HT1103 
100718 HG3342-HT3519 
100991 J03764 
101097 106797 
101168 L15388 
101194 150971 
101261 135545 
101345 L76380 
101447 M21305 
101485 M24736 
101543 M31166 
101550 M31551 
101560 M32334 
101674 M61916 
101714 M68874 
101741 M74719 
101838 M92934 
101857 M94856 
102012 U03057 
102024 U03877 
102164 U18300 
102241 U27109 
102263 U31384 
102303 U33053 
102564 U59423 
102663 U70322 
102759 U81607 
102778 U83463 
102804 U89942 
102887 X04729 
102898 X06256 
102915 X07820 

103036 X54925 

103037 X54936 
103095 X60957 
103158 X67235 
103166 X67951 
103185 X69910 
103280 X79981 
103554 Z18951 
103850 AA167101 
104465 N24990 
104592 R81003 
104764 AA025351 
104786 AA027168 
104850 AA040465 
104865 AA045136 
104694 AA054087 
104952 AA0710S9 
104974 AA085916 
105178 AA1 87490 
105263 AA227926 
105330 AA234743 
105376 AA236559 
105729 AA292694 
105826 AA398243 
105977 M406363 
106006 AA411465 
106031 AA412284 
106124 AA423987 



AW291587 Hs.82733 
086983 Hs.1 18893 
X70377 Hs.1 21489 
NMJX)5402Hs.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NM.005308HS.211569 
L20971 Hs.188 
030857 Hs.82353 
NM_005795Hs.152175 
M21305 

AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
NIvL002291Hs.82124 
M68874 Hs.211587 
NM.003199HS.326198 
BE243845 Hs.75511 
BE550723 Hs.153179 
BE259035 Hs.118400 
AA301867 Hs.76224 
NM_000107Hs.77602 
NM.007351HS.268107 
AW161552 Hs.83361 
U33053 Hs.2499 
U59423 Hs.79067 
NM-002270HS.168075 
NM_005100Hs.788 
AFO00652 Hs.8180 
NMO02318HS.83354 
J03836 Hs.82085 
NMJ)02205Hs.149609 
X07820 Hs.2258 
M13509 Hs.83169 
88)18302 Hs.2894 
NM_005424Hs.78824 
BE242587 Hs.1 18651 
AA159248 Hs.180909 
NM_006825Hs.74368 
U84722 Hs.76206 
A1878826 Hs.74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AI039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AFQ65214 Hs.18858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
H93366 Hs.7567 



nkJogen 2 

Melanoma associated gene 
cystaflnO 

v-ral simian leukemia viral oncogene horn 
Inhibitor of DNA binding 1 , dominant nsg 
serine (or cysteine) proteinase inhttrfto 
chemoWne (C-X-C motif), receptor 4 {fus 
G proteln-coupted receptor kinase 5 
phosphodiesterase 4B, cAMP-spedflc (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-tike 
gbihluman alpha satellite and satellite 3 
selecfin E (endothelial adhesion motecul 
pentexin-reiated gene, rapidly induced b 
serine (or cysteine) proteinase inhiblto 
intercellular adhesion molecule 2 
laminin, beta 1 

pnospnoCpase A2. group IVA (cytosoUc, 
transcription factor 4 
connective tissue growth factor 
fatty add binding protein 5 (psoriasis- 
singed (Drosophla)-Gce (sea urchin fas 
EGF-conbintng fibulin-fflce extraceMa 
damage-specific DNA binding protein 2 (4 
multirnerin 

guanine nucleotide binding protein 1 1 

protein kinase C-Eke 1 

MAD (mothers against decapentapleglc, Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravln) 

syndecan binding protein (syntenin) 

tysyi oxkJase-Cte 2 

serine (or cysteine) proteinase inhliite 
integral, alpha 5 (fibronectln receptor, 
matrix metaHoprotelnase 10 (stromelysin 
matrix metaDoprotelnase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with Immunoglobulin and 
hematopoieijcat!y expressed homeobox 
peroxiredoxin 1 

transmembrane protein (63kD), endoplasml 
cadherin 5, type 2, VE-cadherin (vascula 
caveoHn 1, cavectee protein, 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

WAA0955 protein 

hypothetical protein DKFZp434G171 
B-ceP (XUlymphoma 6, member B (zinc fi 
phosphoBpase A2, group IVC (cytosofic, 
desmopfaMn (DPI, DPI!) 
bromodornatn-oontaining 4 
AD036 protein 

solute carrier family 7, (cation Ic amino 
ESTs 

hypothetical protein FU10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3 ublquitJn ligase SMURF2 

hypothetical protein FU11110 

SRY (sex determining region Y>box 18 

Homo sapiens cDNA: FU22296 fis, done H 

Homo sapiens cDNA RJ21962 Rs, clone H 
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106155 
106302 
06423 
06793 
107174 
107216 
I07295 
107385 
108756 
108846 
108888 
109001 
109166 
109456 
09768 
10107 
10908 
10984 
11006 
11018 
11133 
11760 
13073 
13195 
13923 
14521 
15061 
15096 
15145 
15819 
15947 
16314 
16339 
16430 
16589 
16733 
17023 
17186 
17563 
17997 
18475 
18581 
19073 
19155 
19174 
19221 
19416 
19866 
21335 
21381 
23160 
23473 
23523 
23533 
23964 
24006 
24315 
24659 
24669 
24847 
24875 
25091 
25103 
25355 
25565 
25590 
26511 



I26872 
27402 
127651 
27759 
28062 



M425309 
AA435896 
AA448238 
M478778 
AA621714 
O510S9 
T34527 
U97519 
AA127221 
AA132983 
AA135606 
AA156125 
AA179845 
AA232645 
F10399 
H16772 
N39584 
N52006 
N53375 
N54067 
N64436 
R26892 
T33637 
T57112 
W80763 
AA046808 
AA253217 
AA255991 
AA258138 
AA426573 
AA443793 
M490588 
AA496257 
AA609717 
D59570 
F13787 
H88157 
H98988 
N34287 
N52090 
N66845 
N68905 
R32894 
R61715 
R71234 
R98105 
T97186 
W80814 
AA404418 
AA405747 
AA488687 
AA599143 
AA608588 
AA608751 
C13961 
D60302 
H94892 
N93521 
N95477 
R60044 
R70506 
T91518 
T95333 
R45630 
R20839 
.R23858 
AI024874 
W26247 
AA856990 
AA136653 
AA358869 
A1123976 
AI369384 
AA379500 



M425414 Hs.33287 
AA3S8859 Hs.18397 
AB020722 Hs.16714 
K94997 Ha.16450 
BE122762 Hs.25338 
051069 Hs.211579 
AA186629 Hs.80120 
NMJM5397Hs. 16426 
AA127221 Hs.117037 
AL1 17452 Hs.44155 
AA135606 Hs.189384 
AI056548 Hs.72116 
AA219691 Hs.73625 
AW956580 Hs.42699 
{=06838 Hs.14763 
AW151660 Hs.31444 
AA035211 Hs.17404 
AW613287 Hs.80120 
BE387014 Hs.166146 
A1287912 HsJ3628 
AW580939 Hs.97199 
BE551929 Hs.268754 
N39342 Hs.103042 
H83265 Hs.8881 
AW953484 H&3849 
AW 139036 Hs.108957- 
AI751438 Hs.41271 
A1683069 Hs.175319 
AA740907 Hs.88297 
AA486620 Hs.41135 
R47479 Hs.94761 
AI799104 Hs.178705 
AKD00290 Hs.44033 
AKD01531 Hs.66048 
AI557212 Hs.17132 
AL157424 Hs.61289 
AW070211 Hs.102415 
H98988 Hs.42612 
AF055634 Hs.44553 
N52090 Hs.47420 
N66845 
N68905 

BE245360 Hs.279477 
R61715 Hs.310598 
R71234 

C14322 Hs.250700 
T97186 

AA49620S Hs.193700 
AA404418 

AW088642 Hs.97934 

AA488687 Hs.284235 

AA599143 

AA608588 

AA608751 

C13961 

AI147155 Hs.270016 
NMJ)05402Hs.288757 
AI680737 Hs.289068 



H3.102943 
Hs.304177 
Hs.285814 



129046 AA195678 



AI571594 
W07701 
AK87664 
T91518 

AA570056 Hs.122730 

R60547 Hs.170098 
R20840 

R23858 Hs.143375 

T92143 Hs.57958 

AA516391 HS.181368 

AA001B60 Hs.279531 
AW450979 

AA358869 Hs.227949 

AA362523 Ks.105689 

A1369384 Hs.292441 

AA379621 Hs.105547 

H04150 Hs.107708 

AB029290 Hs. 108258 



nuclear factor l/B 

hypotheScaJ protein FU23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma ceH adhesion molecule 

UDP^^cetyi^ha-D-galacto3amlna;potyp 

podocalyxin-like 

ESTs 

DKFZP586G1517 protein 

gb:z]10a05.s1 Soaras_pregnanLuterus_N bH 

hypothetical protein FU20992 similar to 

RAB6 Interacting, Wnesin-Gke (rabktnes 

ESTs 

ESTs 

ESTs 

ESTs 

UDP^cetyt*toha^alactosarnine:polyp 
Homer, neuronal Immediate earty gene, 3 
mitogen-adlvated protein kinase kinase 
comptement component C1q receptor . 
Homo sapiens cDNA FU1 1949 fis. done HE 
microtibuJa-assodated protein 1B 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FU 22041 similar to 
40S ribosomal protein S27 isoform 
Homo sapiens mRNA fuO length insert cON 
ESTs 
ESTs 

eadomudn-2 
WAA1691 protein 

Homo sapiens cDNAFUl 1333 fis, done PL 

dlpeptidyl peptidase 8 

hypofrefical protein FU10669 

ESTs, Moderately similar to I54374 gene 

synaptojanin 2 

Homo sapiens mRNA; cONA DKFZp586N0121 (f 
ESTs, Weakly similar to ALU 1_HUMAN ALU S 
unc5 (C.etegans homotog) c 
EST 

gfcza46ri IjI Scares fetal liver spleen 
gb:za69b09.s1 Soares.fetaUung_NbHL19W 
ESTs 

ESTs, Moderately similar to ALU1.HUMAN A 
gfcyl54c0a.s1 Scares placenta Nb2HP Homo 
trypiase beta 1 

gb:ye50h09.$1 Scares fetal liver spleen 
Homo sapiens mRNA; cONA DKFZp586l0324 (f 
gtnw37e02j1 Soares_totaLfetusJto2HF8_ 
hypothetical protein FU22252 similar to 
ESTs, Weakly similar to 138022 hypothetl 
gb:ae52d04.s1 Stratagene lung carcinoma 
gb:ae54eQ6.s1 Stratagene lung carcinoma 
gbae56h07.s1 Stratagene tung carcinoma 
gb:C 13961 Ctontech human aorta polyA+mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cONA FU 1 1 918 As, clone HE 

hypothetical protein M6C12916 

Homo sapiens clone FLB8503 PR02286 mRNA, 

sprouty (Orosophila) homotog 4 

gb7e20O55l Stratagene lung (937210) H 

ESTs, Moderately similar to KIAA1 215 pro 

KIAA0372 gene product 

gb^g05c08 Jl Scares infant brain 1 NIB H 

Homo sapiens, done 1MAGE3840937, mRNA, 

EGF-TM74atrophffirwelated protein 

U5 snRNP-speclfic protein (220 kD), orth 

ESTs 

gfoUW«l3-ala-a-12-0-UU1 NCI_CGAP_Su 

SEC13(S.cerevislae)^e1 

MSTP031 protein 

ESTs 

neural proliferation, differentiation an 
ESTs 

actln binding protein; macrophin (mlcrof 
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129188 M30257 
129314 AAQ28131 
129371 M10321 
129488 J03O40 
129765 M86933 
129805 AA012933 
129884 AA286710 
130495 AA243278 
130639 059711 
130657 T94452 
130828 AA053400 
130972 AA370302 
131080 J05008 
131137 U85193 
131182 AA256153 
131486 X83107 
131573 AA046593 
131647 AA410480 
131756 045304 
131859 M90657 
131681 AA010163 
132050 AA136353 
132083 Y07867 
132164 U84573 
132358 X60486 
132413 AA132969 
132456 AA114250 
132490 F13782 
132676 AA283035 
132687 AB002301 
132718 AA056731 
132736 U68019 
132760 H99198 
132933 AA598702 
132968 N77151 
132994 AA505133 
133061 AB000584 
133147 D12763 
133161 AA253193 
133200 AA432248 
133260 AA083572 
133363 AA479713 
133491 L40395 
133517 X52947 
133550 W80846 
133607 M34539 
133614 D67029 ' 
133627 U09587 
133691 MB5289 
133696 010522 
133913 W84712 
133975 029992 
133985 L34657 
134039 S78569 
134088 043636 
134161 U97188 
134299 AA487558 
134416 M28882 
134453 X70683 
134656 X14787 
134989 AA236324 
135051 C15324 
135073 AA452000 
135349 083174 
100114 D00596 
100130 011428 
100143 013640 
100168 D14874 
100208 026129 
100224 028476 
100405 086425 
100420 086983 
100455 087953 
100529 HG1862-HT1897 

100618 HG2614-HT2710 

100619 HG2639+IT2735 



NM001078HS.109225 
BE622768 Hs.290356 
X06828 Hs.110802 
AW410538 Hs.111779 
M36933 Hs.1238 
AA012848 Hs.12570 
AF055581 Hs.13131 
AW250380 Hs.109059 
A1557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs.21739 
NMJ»1955Hs,2271 
W27392 Hs.33287 
AI824144 Hs.23912 
F06972 Hs.27372 
AA040311 Hs.28959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AB67615 Hs.38022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NMJM3542HS.46423 
AW361383 Hs.260116 
AB011084 Hs.46924 
NM.001290Hs.4980 
N92589 Hs361038 
AB002301 Hs.54985 
NM_004600H&554 
AW081883 Hs.211578 
AA1259B5 Hs.56145 
BE263252 Ha.6101 
AF234532 Hs.61638 
AA1 12748 Hs.279905 
AI186431 Hs.296633 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA4O3045 Hs,6906 
AI866286 Hs.71962 
BE619053 Hs.170001 
NIVL000165HS.74471 
AI129903 Hs.74669 
BE273749 

NM_003003Hs.75232 
NM_002047Hs.75280 
M85289 Hs.211573 
AI878921 Hs.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78146 
NWL002290HS.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
AI272141 Hs.83484 
AI750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83464 
W55956 Hs.94030 
AA1 14212 Hs.9930 
X02308 Hs.82962 
NM.000304HS.103724 
AU076465 Hs.278441 
H73444 Hs.394 
NM_002933Hs78224 
AL121516 Hs.138617 
AW291587 Hs.82733 
D86983 Hs.118893 
AW888941 Hs.75789 
BE313693 Hs.334330 
AI752163 Hs.114599 
N24433 H&241567 



vascular ceo adhesion molecule 1 

mesoderm development candidate 1 

von WiDebrand factor 

secreted protein, addle, cystahe-rteh 

ameJogentn (Y chromosome) 

lubulin-spedlic chaperoned 

lysosomal 

mnocnonanai noosomai protein Liz 
ESTs, Moderately similar to 154374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cONA DKFZp586l1518 (f 
endoQie&i 1 
nuclear fader 1/B 
ESTs 

BMX rwrweceptor tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamfly member 1 
upstream regulatory element binding prot 
ESTs 
Pinn 

procollagen-lysine, 2-oxoglutarate 5-dto 
H4 histone family, member G 
metaUoprotease 1 (pitrQysin family) 
KIAA0512 gene product ALEX2 
UM domain binding 2 
ESTs, Weakly similar to 138022 hypothett 
WAA0303 protein 

Sjogren syndrome antigen A2 {60 kD, ribon 
Homo sapiens cONA; FU23037 fis, clone L 
thymosin, beta, identified In neuroblast 
hypothetical protein MGC3178 
myosin X 

doneHQ0310PRO0310p1 

prostate differentia Son factor 

Interleukin 1 receptor-tike 1 

hypothetical protein FU20373 

hypothetical protein FU10210 

Homo sapiens cONA: FU23197 fis, clone R 

ESTs, Weakfy similar to B36298 proline-f 

eukaryotfc translation initiation factor 

gap Junction protein, alpha 1, 43kD (con 

vesicle-associated membrane protein 5 (m 

FK506-binding protein 1A(12kO) 

SEC14(S.cerevisiaeHike1 

gycyURNA synthetase 

heparan sulfate proteoglycan 2 (perlecan 

myristoylated alanine-nch protein kinas 



tissue factor pathway inhibitor 2 
platatef endothelial cefl adhesion molec 
laminin, alpha 4 
WAA0096 protein 
IGF-llmRNA-blnding protein 3 
complement component C1q receptor 
melanoma cefl adhesion molecule 
SRY (sex determining region Y)-box 4 
Oirombospondin 1 

nudtx (nucleoside diphosphate linked moi 

SRY (sex determining region Y>box 4 

Homo sapiens mRNA; cONA DKFZp586E1624 (f 

serine (or cysteine) proteinase Inhbito 

thymldylate synlhetase 

peripheral myelin protein 22 

KIAA0015 gene product 

adrenornedullin 

ifbonuctease, RNase A family, 1 (pancrea 
thyroid hormone receptor interactor 12 
nidogen2 

Melanoma associated gene 

N-myc downstream regulated 

calmodulin 2 (phosphorytase kinase, dell 

collagen, type VIII, alpha 1 

RNA binding motif, single stranded Inter 
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100658 HG2855-HT2995 
100676 HG30444I73742 
100718 HG3342-HT3519 
100752 HG3543-HT3739 
100828 HG4069-HT4339 
100850 HG417-HT417 
100991 J03764 
101097 L08797 
101110 L08246 
101142 112711 
101156 L13977 
101168 L15368 
101184 L19871 
101192 L20859 
101317 L42176 
101336 L49169 
101345 L76380 
101400 M15990 
101475 M23254 
101485 M24736 
101496 M26576 
101505 M27396 
101543 M31166 
101557 M31994 
101560 M32334 
101587 M35876 
101592 M36429 

101633 M57730 

101634 M57731 
101667 M60858 
101682 M62994 
101714 M68874 
101720 M69043 
101741 M74719 
101744 M75126 
101793 M&4349 

101837 M92&43 

101838 M92934 
101840 M93056 
101857 M94856 
101864 M95787 
101931 S76965 
101966 S81914 

102012 U03057 

102013 U03100 
102024 U03877 
102059 U08021 
102121 U14391 
102283 U31384 
102300 U32944 
102378 U40369 
102395 U41767 
102460 U43959 
102491 U51010 
102499 U51478 
102523 U53445 
102560 U59289 
102564 U59423 
102589 U62015 
102600 U63825 
102645 U67963 
102687 U73379 
102693 U73824 
102709 U77604 
102759 U81607 
102804 U89942 
102882 X04412 
102907 X06985 
102915 X07820 
102927 X12876 
102960 X15729 
103011 X52541 
103020 X53416 
103029 X54489 
103036 X54925 
103056 X57206 



U56725 Hs.180414 
X02761 Hs.287820 
BE295928 HsJ5424 
T81309 

AL043753 Hs.303649 
AA836472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
AI439011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs.75693 
NH_005308Hs^11569 
NMJ»1674Hs.460 
BE247295 Hs.78452 
L42176 Hs^302 
NM_006732Hs.75678 
NM_005795Hs.152175 
M15990 Hs.194148 
BE410405 Hs76288 
AA296520 Hs.89546 
X12784 Hs.119129 
AA307680 HsJ5692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NM_004428Hs.1624 
AV650262 Hs.75765 
NIVL005381 
AF043045 Hs.81008 
M68874 H2L211587 
M69043 Hs.81328 
NM_003199Hs^26198 
AI879352 Hs.118625 
W01076 Hs.278573 
M92843 Hs.343586 
BE243845 Hs.75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NM_006823HsJ5209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs.76224. 
A1752666 Hs.76669 
NM_004998Hs.82251 
AW161552 Hs.83381 
A1929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs76941 
053445 Ha.15432 
R97457 Hs.63984 
U59423 HsJ9067 
AU076728 Hs,8867 
AI984144 Hs.66713 
AL119566 Hs.6721 
NM.007019HS.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NM_005100Hs788 
NM.002318HS.83354 
A1767736 Hs.290070 
BE409861 Hs.202833 
X07820 Hs.2258 
BE512730 H3.65114 
AI904738 Hs.76053 
AJ243425 Hs.326035 
X53416 Hs.195464 
AW800726 Hs.789 
M13509 Hs.83169 
Y18024 HS78877 



heat shock 70© protein 2 
fibjonecfin 1 

Inhibitor of ONA binding 1, dominant neg 
InsulWilce growth factor 2 (somatomedl 
small Inducible cytokine A2 (monocyte ch 
cafhepsin B 

serins (or cysteine) proteinase tnhlbfto 

chemoktoe (C-X-C motif), receptor 4 (fus 

myeloid ceD leukemia sequence 1 (BCl2-r 

transketee (WemJcke-Korsatoff syndro 

prolytcarboxypepfidase (angtotenshase C 

G proteln-coup!ed receptor kinase 5 

activating transcription factor 3 

solute carrier family 20 (phosphate tran 

four and a half LIM domains 2 

FBJ murine osteosarcoma viral oncogene h 

calcitonin receptor-like 

v-yes-1 Yamaguchl sarcoma viral oncogene 

calpain 2, (m/ll) braesubunfi 

selectln E (endothelial adhesion molecut 

collagen, type IV, alpha 1 



pentaxin-retated gene, rapidly Induced b 
aJdehyde dehydrogenase 1 femfly, member 
Intercellular adhesion molecule 2 
insuCn-fike growth factor binding prate 
guanine nucleotide binding protein (G pr 
ephrin-A1 
GR02 oncogene 
nucfeoDn 

Ramin B, beta (acfJn-cinding proteln-2 
phosphoiipase A2, group I VA (cytosolic, 
nuclear {actor of kappa fight polypep&d 
transcription factor 4 
hexoWnase 1 

C059 antigen p1&-20 (antigen identified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase Inhiblto 
fatty acid binding protein 5 (psoriasis- 
transgefin 

protein kinase (cAMP-de pendent, cataiyfi 
Immediate early response 3 
singed prosophllaHlke (sea urchin fas 
catenln (cadherin-assodated protein), a 
EGF-containing ffbuDn-Kke extra ceiluta 



myosin IE 

guanine nucleotide binding protein 1 1 
dynein, cytoplasmic, light polypeptide 
spermine/spermine Nl-acetyibransferase 
a disfntegrin and metafloproteinase doma 
myosin, light polypeptide kinase 
gb'Xuman nicotinamide N-methyttransferas 
ATPase, Na^+ transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherln 13. H-cadherfn (heart) 
MAD (mothers against decapentaptegic, Or 
cystetne-rlch, angiogenic inducer, 61 
hepatitis delta anflgen-interacfing prat 



uWqultjn carrier protein E2-C 
eukaryotic translation Initiation factor 
microsomal glutathione S-transferase 2 
A kinase (PRKA) anchor protein (gravf n) 
lysyl oxidase4ike 2 
gefeo&n (amyloidosis. Finnish type) 
heme oxygenase (decycfing) 1 
matrix metalloprotetoase 10 (stromelysin 

DEAD/H (Asp^AraAsjVHis) box potypep 
earty growth response 1 
fiiamln A, alpha (acflrvbtndlng protein- 
GR01 oncogene (melanoma growth stimulati 
matrix metalbproteiiase 1 (Interstitial 
Inositol 1.4,5-trfsphosphate 3-klnase B 
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103080 X59798 
103095 X60957 
103138 X55965 
103176 X69111 
103195 X70940 
103347 X87838 
103371 X91247 
103432 X97748 
103471 Y00815 
103987 AA303711 
104447 L44538 
104764 AA025351 
104783 AA027050 
104798 AA029462 
104865 AA045136 
104877 AA047437 
104894 AA0540S7 
104952 AA071089 
105113 AA156450 
105178 AA187490 
105196 AA195031 
105215 AA205724 
105263 AA227926 
105271 AA227986 
105330 AA234743 
105461 AA253216 
105492 AA256210 
106493 AA256268 
105594 AA279397 
105727 AA292379 
105732 AA292717 
105767 AA346551 
105682 AA400292 
105936 AA404338 
106031 AA412284 
106124 AA423987 
106222 AA426594 
106241 AA430108 

106263 AA431462 

106264 AA431470 
106366 AA443756 
106454 AA449479 
106634 AA459916 
106724 AA465226 
106793 AA478778 
106799 AA479037 
106842 AA482597 
106868 AA487561 
106890 AA489245 
106961 AA504110 
106974 AA520989 
107030 AA599434 
107061 AA608649 
107086 AA609519 
107216 051069 
107385 U97519 
107444 W28391 
107985 AA035638 
108507 AA083514 
108695 AA121315 
108931 AA147186 
109001 AA156125 
109195 AA188932 
109390 AA219653 
109456 AA232645 
109737 F10078 
110411 H48032 
110660 H82117 
110906 N39584 
111018 N54067 
111091 N59858 
111356 N90933 
111378 N93764 
111741 R26124 
111769 R27957 
112318 R55470 



AU077231 Hs.82932 
KM_005424Hs.78824 
X65965 

AL021154 Hs.76884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs.156044 
AI039243 Hs2785B5 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs22575 
AI138635 Hs.22968 
AF065214 Hs.18858 
AW076098 Hs.345588 
AB037816 Hs.8982 
AA313825 I&21941 
W84893 Hs5305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW336625 Hsi2120 
BE539071 Hs.69388 
AI805717 Hs.289112 
AL047586 Hs.10283 
AB024334 H&25001 
AI1351S9 Hs^034Q 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 
AI678765 Hs.21812 
X64116 Hs.171844 
H93366 Hs.7567 
AA356392 HsJ21321 
BE019681 Hs.6019 
W21493 Hs^8329 
AL046859 Hs^407 
AA186715 Hs.336429 
NM_014038Hs.5216 
W25491 Hs.288909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF124251 Hs.26054 
BE185536 H 3^0 11 83 
AA489245 Hs.88500 
AW243614 Hs.18063 
A1817130 Hs.9195 
AL117424 Hs.25035 
BE147611 Hs.6354 
NM_012331Hs.26458 
051069 Hs.211579 
NWL005397HS.16426 
W28391 Hs.343258 
T40064 Hs.71968 
A1554545 H3.68301 
AB029000 Hs.70823 
AA147186 

AI056548 Hs.72116 
AF047033 Hs. 132904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 H328043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 HS.33032 
BE301871 Hs.4867 
AW160993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 



cycTm 01 (PRAD1; parathyroid adenomatos 

tyrosine kinase wflh Immwwglobulln and 

gb:H.sapians SOD-2 gene for manganese su 

Inhibitor of DMA binding 3, dominant neg 

eukaryotfc translation elongation factor 

catenin (cadhenn-associated protein), b 

thioredoxin reductase 1 

gbHsaptens PTX3 gene promotor region. 

protein tyrosine phosphatase, receptor t 

ephrin-B1 

ESTs 

ESTs 

Homo sapiens done TCCCIA00176 mRNA sequ 

B-ceO CUJymphoma 6, member B (zinc fi 

Homo sapiens done IMAGE:451939, mRNA se 

phosphoOpase A2, group IVC (cytosodc, 

desmopiakin (DPI, DPI!) 

Homo sapiens, clone IMAGE:3506202, mRNA, 

AD036 protein 

angiotensin receptor-like 1 

hypothetical protein RJ 14957 

solute carrier family 7, (cailonlc amino 

ESTs 

ESTs 

hypothetical protein FU20505 

CGW3 protein 

RNA binding motii protein 88 

tyrosine 3-monooxygenasa/tryptophan 5-mo 

WAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (DrosophOa) homobg 2 (mitogen 
ESTs 

Homo sapiens cDNA: FU22296 Us, done H 

Homo sapiens cONA; FU21962 fls, done H 

Homo sapiens done FLB9213 PR02474 mRNA, 

Homo sapiens cDNA: FU21288 fls, done C 

hypoflietical protein FU14005 

protein kinase (cAMP-dependent, catah/ti 

RIKEN cDNA 9130422^119 gene 

HSPC028 protein 

hypotheficaJ protein FU22471 

Homo sapiens cDNA: RJ22141 fis, done H 

ESTs 

Homo sapiens done 25012 mRNA sequence 
novel SH2<ontatntng protein 3 
molecule possessing ankyrtn repeats indu 
mrtogon-actlvated protein kinase 8 Inter 
Homo sapiens cONA FU 10768 fis, done NT 
Homo sapiens cDNA FU 13698 fis, done PL 
chloride intracellular channel 4 
stromal cell derived (actor receptor 1 
methionine sulfoxide reductase A 
melanoma celi adhesion motearfe 
pcdocalyxto-Cke 

proHferation-associated 2G4, 38kO 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

ESTs 

WAA1077 protein 

gb:zo38d01.s1 Stratagene endomeiial eel 
hypothetical protein FU20992 similar to 
solute carrier family 4, sodium bicarbon 
EH-domain containing 3 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
Homo sapiens mRNA for KIAA1741 protein, 
ESTs 
ESTs 

mitogervactivated protein kinase kinase 
hypothetical protein DKFZp434N185 
mannosyl (alpha-1 3-^lycoprotein beta- 
nypometfcalgene DKFZp434A1114 
K1AA0846 protein 
ESTs 

ESTs, Highly similar to T46395 hypothetl 
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112951 
113057 
113195 
113490 
113542 
113803 
113847 
113910 
113947 
114047 
115061 
115819 
115870 
115964 
116228 
116264 
116314 
116589 
117023 
117112 
117156 
117176 
117280 
119559 
119866 
120655 
121314 
121335 
121822 
121835 
122331 
122577 
123160 
123486 
124059 
124339 
124358 
124364 
124726 
124763 
125167 
125304 
125307 
125329 
125598 
125609 
418245 
127435 
127566 
127619 
128453 
12B495 
128515 
128580 
128623 
128642 
128669 
128903 
128914 
129087 
129188 
129226 
129265 
129345 
129468 
129488 
129498 
129557 
129619 
129627 
129762 
129884 
130018 
130147 
130178 
130282 



T16550 

T26674 

757112 

T88700 

T90527 

W42789 

W60002 

W78175 

W84768 

W94427 

AA253217 

AA426573 

AA432374 

AA446622 

AA478771 

AA482594 

AA490588 

D59570 

H88157 

H94648 

H97538 

H98670 

N22107 

W38197 

W80814 

AA287347 

AA402799 

AA404418 

AA425107 

AA425435 

AA442872 

AA452860 

AA488687 

AA599674 

F13673 

H99093 

N22495 

N23031 

R15740 

R39610 

W45560 

239833 

Z40583 

AA825437 

R66613 

AA866063 

AA1 28075 

N66570 

AI051390 

AA627122 

X02761 

AF010193 

AA149044 

U82108 

078676 

L35240 

AA598737 

R69417 

AA232837 

N72695 

M30257 

M96843 

X68277 

AA292440 

J03040 

AA228107 

AA4497B9 

W01367 

AA610116 

AA258308 

AA460273 

AA286710 

T68873 

063476 

M62403 

X55740 



AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs.7890 
AW88O709 Hs^83683 
NM.005032Hs.4114 
AA1 13262 Hs.17901 
W84768 

AL035858 Hs.3807 
AJ751438 Hs.41271 
AA486620 Hs.41135 
NM.005985HS.48029 
AA987568 Hs.74313 
AI767947 HsJ0841 
051174 Hs.272239 
AT799104 Hs.178705 
AI557212 Hs.17132 
AWQ70211 Ha.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 Hs.172129 
W38197 

AA496205 Hs.193700 
AA305599 Hs .238205 
W07343 Hs.1 82538 
AA404418 
AI743860 

AB033O30 Hs.300670 
AL133437 Hs.1 10771 
AA829725 Hs.334437 
AA488687 Hs.234235 
BE019072 Hs.334802 
BE387335 Hs.283713 
H99093 Hs.343411 
AW07O211 Hs.102415 
AF265555 Hs.250646 
NM_003654Hs. 104576 
BE410405 Hs.76288 
AL137540 Hs.102541 
AL359573 Hs.124940 
AW580945 Hs.330466 
AA825437 Hs.58875 
T40064 Hs.71968 
AA868063 Hs.104576 
AA088767 Hs.83883 
X69086 Hs.286161 
AI051390 Hs.1 16731 
AA627122 Hs.163787 
X02761 Hs.287820 
NM_005904Hs.100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.180414 
AW150717 Hs.345728 
AW867491 Hs.107125 
AI348027 Hs.108557 
NM.001078Hs,109225 
BE222494 Hs.1 80919 
AA530892 Hs.171695 
R22497 Hs.110571 
AW410538 Hs.111779 
AW966728 Hs.54642 
AA449789 HsJ5511 
AL045404 Hs.46366 
AA209534 Hs.284243 
T40064 Hs.71968 
AA453694 Hs.12372 
AF055581 Hs.13131 
AA353093 

063476 Hs.172813 
U20982 Hs.1516 
BE245380 Hs.153952 



vacuolar protein sorfing 45B (yeast homo 
Human DNA sequence from done RP1-187J1 1 
ESTs, WeaMy sMtar to S41044 chromosom 
Homo sapiens cDNA RJ10500 (Is, done NT 
Homo sapiens mRNA for KLAA1671 protein, 
chjoroosome 8 open reading frame 4 
piastm 3 (T isofomt) 

Homo sapiens, done IMAGE3937015, mRNA, 
gb:zh53dQ3.s1 Scares jBtaUiver_sp!een_ 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA U tengft Insert cDN 
endomucln-2 

snail 1 (drosophBa homoiog), zinc ftnge 

K1AA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA RJ1 1333 fist, done PL 
ESTs, Moderately similar to 154374 gene 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 
ESTs 

uveal autoanfjgen with rated col domai 
Homo sapiens cDNA: FU21409 fis, done C 
En^iricatly selected from AFFX single pr 
Homo sapiens mRNA; cONA OKFZp586l0324 (f 
hypoffieflcat protein PRO2013 
phospholipid scramblase 4 
gb:zw37e02,s1 Scares JotaJJetus_Nb2HF8__ 
metaOothlonein 1E (functional) 
WAA1204 protein 

Homo sapiens cONA: RJ21904 fis, done H 
hypotheflcal protein MGC4248 
ESTs, Weakly similar to 138022 hypotheti 
Homo sapiens cONA FU14680 fis, done NT 
ESTs, Weakly similar to S64054 hypotheti 
DEAO/H (Asp-Glu-Ala-Mp/HIs) box polypep 
Homo sapiens mRNA; cONA OKFZp586N0121 (f 
bacubviral IAP repeat-oontaWng 6 
carbohydrate (keratan sulfate Gai-6) sul 
calpain 2, (mfll) large subunlt 
neb1n4 

GTP-bindlng protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA OKFZp564F053 (fr 
carbohydrate (keratan sulfate GaW) sul 
transmembrane, prostate androgen Induced 
Homo sapiens cDNA RJ13613 fis, clone PL 
ESTs 
ESTs 

fibronedJn 1 

MAO (mothers against decapentapleglc Or 
type I transmembrane protein Fn14 
solute carrier family 9 (sooTunVhydrogen 
CTL2gene 

enigma (UM domain protein) 
heat shock 70kD protein 8 
STATInducedSTATInraWtor3 
ptasmaJemma vesicle associated protein 
hypothetical protein PP1057 
vascular ceO adhesion molecule 1 
innfcitorof DNA binding 2, dominant neg 
dual specificity phosphatase 1 
growth arrest and DNA-damage-indudbte, 
secreted protein, acidic, cystelne-rtch 
methionine adenosytiranaferase II, beta 
connective tissue growth factor 
KIAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cONA OKFZp564F053 (fr 
tripartite motif protein TR1M2 



metaUofhioneln 1L 

PAK-lnteracting exchange factor beta 
InsuBn-t&e growth factor-binding prote 
5 , nudeotldase(CD73) 
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130431 L102S4 
130495 AA243278 
130553 AA430032 

130638 H16402 

130639 D59711 
130657 T94452 
130686 AA431571 
130776 R79356 
130818 AA280375 
130840 Z49269 
130899 Z41740 
131002 AA121543 
131060 J05008 
131084 AA101878 
131091 T35341 
131107 N87590 
131162 AA256153 
131207 W74533 
131319 U25997 
131328 V01512 
131328 V01512 
131328 V01512 
131328 V01512 
131509 X56681 
131555 AA161292 
131564 AA491465 
131573 AA046593 
131692 D50914 
131756 D45304 
131859 M90657 
131909 W69127 
131915 AA316186 
132046 AA384503 
132050 AA136353 
132151 AA044755 
132164 U84573 
132187 AA058911 
132303 AA620962 
132314 AA285290 
132358 X60486 
132398 R31641 
132421 AA489190 
132490 F13782 
132520 AA257993 
132546 M24283 
132610 AA443114 
132716 T35289 
132B40 N23817 
132883 AA047151 
132968 N77151 
132989 AA480074 
132999 Y00787 
133071 T99789 
133076 W84341 
133099 L09209 
133147 D12763 
133149 T16484 
133161 AA253193 
133200 AA432248 
133220 X82200 
133260 AA083572 
133295 L00352 
133349 N75791 
133391 X57579 
133398 X02612 
133436 H44631 
133454 AA090257 
133478 X83703 
133491 L40395 
133510 AA227913 
133517 X52947 
133526 M11313 
133538 L14837 
133562 M60721 
133584 090209 
133590 T67986 



AW505214 Hs.155560 
AW250380 Hs.109059 
AF062649 Hs.252587 
AW021276 Hs.17121 
AI557212 Hs.17132 
AW337575 Hs.201591 
BE548267 Hs337986 
AF167706 Hs.19280 
AW190920 Hs. 19928 
BE048821 Hs.20144 
A1077288 Kjl296323 
ALO 50295 Hs 22039 
NR-L001955H9J2271 
NM_017413H3J03084 
AJ271216 Hs22880 
BE620886 Hs.75354 
A1824144 Hs.23912 
AF104266 Hs.24212 
NMJM3155H&25590 
AW939251 H&25647 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251 Ha .25647 
X56681 HsJ780 
T47364 Hs.278613 
T93500 Hs.28792 
AA040311 Hs.28959 
BE559681 Hs.30736 
AA443966 Hs.31595 
AW960564 

NMJ)16558Hs.274411 
AJ161383 HS.34549 
AI359214 Hs.179260 
AI267615 Hs.38022 
BE379499 Hs.173705 
AI752235 Hs.41270 
AA235709 Hs.4193 
BE177330 Hs.325093 
AF1 12222 Hs.323806 
NM_003542Hs.46423 
AA876616 Hs.16979 
AW163483 Hs.48320 
NMJX)1290Hs.49&0 
AA257992 Hs.50651 
M24283 Hs. 168383 
AA160511 H35326 
BE379595 Hs.283738 
BE218319 Hs.5807 
AA373314 Hs.5897 
AF234532 Hs.61638 
AA480074 Hs.331328 
Y00787 Hs.624 
BE384932 Hs.64313 
AW946276 Hs.6441 
W16518 Hs.279518 
AA026533 Hs.66 
AA370045 Hs.6607 
AW021103 Hs.6631 
AB037715 Hs.183639 
NM.006074HS.318501 
AA403045 Hs.6906 
AI147861 Hs.213289 
AW631255 Hs.8110 
AW103364 Hs.727 
NMJXJ0499HS.72912 
BE294068 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 
BE619053 Ks.170001 
AW880841 Hs.96908 
NM_000165Hs.74471 
AU077051 Hs.74561 
NrVt_003257Hs.74614 
M60721 Hs.74870 
090209 Ka.181243 
T70956 Hs.75106 



cahaxin 

rnitocbondrfai rtbosomai protein L12 
pituitary tunw^ranstonrfng 1 
ESTs 

ESTs, Moderately stmflar to 154374 gene 
ESTs 

Homo sapiens cDNA FU10934 Gs, done OV 

cystetne-rlch motor neuron 1 

hypofiiefcaJ protein SP329 

small inducible cytokine subfamSy A (Cy 

serarttfgbcocorflooid regulated kinase 

K1AA0753 protein 

endoflie&il 

apelln; peptide Dgand foiAPJ receptor 

dipepfidytpepOdase III 

GCN1 (general control of amino-acid synt 

ESTs 

biropftffln 

stannbcaJdn 1 

v-fos FBJ murine osteosarcoma viral onco 
v^FBJrnurtnecsteosaro^viraJonco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
Jun D protooncogene 
interferon, atoha-todudble protein 27 
Homo sapiens cDNA FU 11041 Gs, done PL 
ESTs 

WAA0124 protein 
ESTs 

transmembrane 4 superfamOy member 1 

SCAN domain-containing 1 

ESTs, Highly similar to S94541 1 done 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA FU22050 fis, done H 

procollagen-lysine, 2-oxoglutaratB 5-dio 

0KFZP586O1624 protein 

Homo sapiens cDNA FU21210fis. done C 

pinin, desmosome associated protein 

H4 htstone family, member G 

ESTs, Weakly slmflar to A43932 mucin 2 p 

double ring-finger protein, DortJn 

LiM domain binding 2 

Janus kinase 1 (a protein tyrosine Wnas 

intercellular adhesion molecule 1 (COM) 

amino add system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPase Rab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
myosin X 

hypolhBJicai protein FU13213 
interieukin 8 

ESTs, Weakly similar to AF257182 1 G-pro 
Homo sapiens mRNA; cONA OKFZp586J021 (fr 
amytokl beta (A4) precursor-Gke protein 
interieukin 1 receptor-like 1 
AXJN1 up-regulated 
hypothetical protein FU20373 
hypothetical protein FU10210 
Homo sapiens mRNA fuD length Insert cON 
Homo sapiens cDNA FU23197 fls, clone R 
tow density lipoprotein receptor (famifi 
L^yorcjxyacyi-Coenzyme A dehydrogenase 
foWbta, beta A (acflvin A, activtn AB a 
cytochrome P450, subfamily I (aromatic c 
immediate early protein 
hypothetical protein MGC5618 
cardiac ankyrin repeat protein 
eukaryofJc translaSon Inrfiation factor 
p53-induced protein 

gap juncfion protein, alpha 1 , 43kD (con 

aJpha-2-macrogiobuDn 

tight junction protein 1 (zona oodudens 

H2.0 (DrosophJlaHike homeo box 1 

activating transcription factor 4 (tax-r 

dusterin (complement lysis Inhibitor, S 
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133617 
133651 
133671 
133678 
133681 
133722 
133730 
133750 
133802 
133825 
133836 
133859 
133889 
133960 
133975 
133977 
134039 
134075 
134081 
134164 
134203 
134238 
134299 
134332 
134339 
134343 
134381 
134403 
134416 
134493 
134558 
134817 
134983 
134989 
135052 
135062 
135069 
135071 
135073 
135170 
135196 
135348 
134404 
439561 
100082 
132817 
130150 
100104 
447973 
332613 
100113 
133980 
100129 
100154 
100169 
129718 
100190 
134742 
100211 
100238 
130283 
134237 
100248 
100256 
100262 
134329 
100281 
100294 
100327 
100335 
134495 
100338 
135152 
■134269 
100372 
134304 



AA148318 

U97105 

T25747 

K02574 

078577 

X53331 

S73591 

X95735 

L16862 

U44975 

M97798 

U86782 

AA099391 

M19267 

D29992 

L19314 

S78569 

U28811 

177886 

C14407 

M60278 

R81509 

AA487558 

D86962 

AM78971 

D50683 

U56637 

MS1199 

M28882 

X15183 

S53911 

U20734 

028235 

AA236324 

AA148923 

AA174183 

AA456311 

L08069 

AA452000 

AA282140 

J02854 

AA442054 

AB000450 

AB002380 

AB003103 

A6004864 

AF000573 

AF008937 

AF0093Q1 

AHJ09368 

D00591 

D00760 

011139 

D14657 

D14878 

D17716 

021090 

026135 

026528 

D30742 

D31762 

D31765 

D31888 

D38128 

038500 

D38551 

D42087 

049396 

D55640 

D63391 

D63477 

063483 

D64015 

079990 

D79997 

D80010 



BE244334 Hs.75249 
AJ301740 Hs. 173331 
AW503116 Hs.301819 
AW247252 
AJ352558 

AW969976 Hs.279009 
BE242779 Hs. 179526 
BE410769 Hs.75873 
AW239400 Hs.76297 
BE616902 Hs.285313 
BE222494 Hs.180919 
U86782 Hs,178761 
U48959 Hs.211582 
M19267 Hs.77899 
C18356 1&295944 
AI125639 Hs.250666 
NM_002290Hs.78672 
NMJD12201HS78979 
AL034349 Hs.79005 
AW245540 Hs.79516 
AA161219 Hs.799 
AA102179 Hs.160726 
AW580939 Hs.97199 
086962 Hs.61875 
R70429 Hs.81988 
050683 Hs.82028 
AJ557280 Hs.184270 
AA334551 

X68264 Ks.211579 
M30627 Hs.289083 
NWL001773Hs35289 
AU076592 Hs.198951 
D28235 Ks.196384 
AW968058 Hs.92381 
AL136653 Hs.93675 
AKD00967 Hs.93872 
AA876372 Hs.93961 
W27190 Hs.94 



W55956 
T53169 
C03577 
U80983 



Hs.94030 
Hs.9587 
Hs.9615 
Hs.268177 



AB000450 Hs.82771 
AF180681 Hs.6562 
AA 130080 Hs.4295 
N27852 Hs.57553 
BE094848 Hs.15113 
AFO08937 

AB011169 Hs.20141 
AF029674 Hs.173422 
NM.001269HS.84746 
AA294921 Hs.348024 
AA469369 Hs.5831 
H60720 Hs.81892 
AL037228 Hs.82043 
NM_002410Hs.121502 
M91401 Hs.178658 
NM_001346Hs.89462 
026528 Hs.123058 
L24959 Hs.348 
NM_012288Hs.153954 
D31765 H3.170114 
NM.015156HS.78398 
025418 Hs.393 
D38500 Hs.278468 
N92036 H&81848 
AF091035 Hs.184627 
AA331881 Hs.75454 
055640 

AW247529 Hs.6793 
063477 HsMm 
DB6864 Hs.57735 
M96954 Hs. 182741 
NMJM4737HS.80905 
NhL014791Hs.1B4339 
BE613486 Hs.81412 



AOP-ribosyfafion factor-fte 6 Irtteraca" 
dihydrapyrimidinase-Oke 2 
ztoc finger protein 146 



matrix Gta protein 
upregutated by l^dihydraxyvftamin 0-3 
zyxln 

G protein-oouptBd receptor kinase 6 

core promoter element binding protein 

tihfcltofofD^bhdlng^ctomlnantneg 

26S proteasome^assodated path homoJog 

myosin, Dg ht porypepfjde kinase 

tropomyosin 1 (alpha) 

tissue factor pathway Inhibitor 2 

hairy {Drosophfla>homolog 

tamlnin, alpha 4 

Golgl apparatus protein 1 

proteki tyrosine phosphatase, receptor t 

brain abundant, membrane attached signal 

diphtheria toxin receptor (tapanb-btndJ 

Homo sapiens cONA FU11680 lis, clone HE 

oornptemontoompooefttClq receptor 

growth factor receptor-bound protein 10 

disabled (DrosophSa) homotog 2 (mitogen 

transforming growth factor, beta recepto 

capping protein (ac&i filament} muscle 

sperm specific antigen 2 

melanoma ceS adhesion molecule 

heat shock 90kO protein 1, alpha 

CD34 antigen 

Jun B proto-oncogene 

prostagiandin-endoperaxide synthase 2 (p 

nudix (nucleoside diphosphate linked moi 

decidual protein Induced by progesterone 

K1AA1682 protein 

Homo sapiens mRNA; cONA DKFZp667D095 (fr 
DnaJ (Hsp40) homotog, subfamily A, membe 
Homo sapiens mRNA: cONA OKFZp586E1624 (f 
Homo sapiens cONA: RJ22290 fo, clone H 
myosin regulatory light chain 2, smooth 
phosphoSpase C, gamma 1 (formerly subty 
vaccinia related kinase 2 
Rho guanine exchange factor (GEF) 12 
proteasome (prosome, macropain) 26S subu 
tousted-ike kinase 2 

homogentlsate 1,2-aJoxygenase (homogenti 
syntaxln 16 

skrdtar to S. cerevisiae SSM4 

K1AA1605 protein 

ctoornosome condensation 1 

v-ral simian leukemia viral oncogene horn 

tissue Inhibitor of metaitoprotetnase 1 

KIAA0101 gene product 

0123 gene product 

mannosyt (abha-1,6-)^tyooprotefn beta- 
RAD23 (S. cerevlslae) homotog B 
diacytgfycerol kinase, gamma (90kO) 
OEADJH (Asp-GhKAIa-Asp/Hls) box poiypep 
catctuiivcafmc<yin-dependent protein kin 
TRAM#e protein 
KJAA0061 protein 
KIAA0071 protein 

prostaglandin (2 (prostacyclin) receptor 
postmdofc segregation increased 2-iike 
RA021 (S. pombe) homobg 
K1AA01 18 protein 



gbrHuman monocyte PABL (pseudoautosornal 

platelet-activating factor acetyihydrola 

WAA0143 protein 

acetyl LDL receptor; SREC 

TTA1 cytotoxic granule-associated RNA-fcl 

Ras association (RalGDS/AF-6) domain fam 

KIAA0175 gene product 

Spin 1 
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100394 
100405 
100418 
133154 
134347 
444099 
10043d 
134593 
100481 
100552 
100591 
100652 
100662 
100899 
100905 
100945 
100950 
100964 
135407 
130149 
131877 
101016 
134786 
134100 
134078 
101132 
134849 
332736 
101152 
135397 
432642 
101168 
421155 
101226 
415138 
134739 
130155 
440536 



101294 
101300 
101310 
130344 
101381 
101381 
415678 
133780 
101396 
101447 
101458 
101470 
134604 
101478 
133519 
131185 
134116 
133999 
130174 
129963 
132983 
133900 
101543 
101545 
101620 
134691 
133595 
101700 
101714 
134246 
101760 
415022 
415022 
415022 
415022 
415022 
101791 



D84276 
086425 
D86978 
087012 
087075 
087432 
087448 
087845 

HG1098-HT1098 
HG2167-HT2237 
HG241&HT2511 
HG282&HT2949 
HG2887-KT3031 
HG466OUT5073 
HG4704+IT5146 
HG884-HT884 
HG919-HT919 
J00212 
J04029 
J04031 
J04088 
J04543 
106139 
L07540 
108895 
L11239 
L11353 
L13773 
L13800 
L14922 
L15189 
L15388 
L16895 
L27476 
L27624 
132976 
L33404 
L35263 
L37347 
L40371 
L40391 
L41607 
L77566 
M13928 
M13928 
M14016 
M14219 
M15796 
M21305 
M22092 
M22898 
M22995 
M23379 
M24400 
M25753 
M27691 
M28213 
M29550 
M29971 
M30269 
M31158 
M31166 
M31210 
M55420 
M59979 
M62810 
M64710 
M68874 
M74524 
M80254 
M81780 
M81780 
M81780 
M81780 
M81780 
M83822 



D84284 Hsj66052 
AW291587 Hs-82733 
D86978 H3J4790 
D87012 Ks.194685 
AF164142 Hs.82042 
087432 Hs.10315 
AA013051 Ks.91417 
KM_000437H3234392 
X70377 Hs.121489 
AA019521 Hs.301946 
NM_004091Hs.231444 
BE613608 Hs.142653 
A1368680 Hs.816 
AL039123 Hs.103042 
L12260 Hs.172816 
AF002225 Ka.180686 
AF128542 Hs. 166846 
J00212 

J04029 Hs.99938 
AW067805 Hs. 172665 
J04088 Hs.156346 
J04543 Hs.78637 
T29618 Hs69640 
AA460085 Hs.171075 
L08895 Ha.78995 
L11239 Hs.36993 
BE409525 Hs.902 
Z83689 Hs.114765 
AI984625 Hs.9884 
L14922 Hs.166563 
BE297635 Hs.3069 
NM.00530BHs.21 1569 
H87879 Hs.102267 
AF083892 Hs.75808 
C18356 Hs.295944 
NNL002419HS.89449 
AA101043 Hs.151254 
W76332 Hs.79107 
BE313625 Hs.57435 
AF168418^Hs.1 16784 
BE535511 
L41607 Hs.934 
AW250122 Hs.154879 
AW675039 Hs.1227 
AW675039 Hs.1227 
AW005903 Hs.78601 
AA557660 Hs.76152 
BE267931 Hs.78996 
M21305 
M22092 

NM_000546Hs.1846 
NM.002884Hs.865 
NM.002890Hs.758 
AW583062 Hs.74502 
BE280074 Hs.23960 
R84694 Hs.79194 
AA535244 Hs.78305 
M29551 Hs.151531 
M29971 Hs.1384 
M30269 

M31158 Hs.77439 
M31166 Hs.2050 
BE246154 Hs.154210 
S55271 Hs.247930 
AW382987 Hs.88474 
AA393273 Hs.75133 
D90337 Hs.247916 



C038 antigen (p45) 

nkJogen 2 

WAA0225 protein 

topotsornerase (DMA) til beta 

solute earner family 23 (nudeobase tra 

solute carrier famKy 7 (caflonfc amino 

topoisomarase {DNA) tl binding protein 

platelet^acttvafing factor acetythydrola 

cystafinD 

lysosomal 

Homo sapiens, Skitter to hypothefical pr 
ret finger protein 

SRYIsexoetennWng region Y)-box2 
mterotiihule-associated protein 1B 
neureguEn 1 

ubJquffin protein Bgase E3A (human papi 
polymerase (DNA directed), epsiton 
EmpWcaDy selected from AFFX single pr 
keratin 10 (epkfermoty&c hyperkeratosis 



i (DNA) R alpha (170kO) 
annextn A7 

TEK tyrosine kinase, endothelial (venous 
replication factor C (acfivator 1) 5 (36 
MADS box transcription enhancer factor 2 
gastruiation brain homeo box 1 
rieurofibromh 2 (bilateral acoustic neur 
myeioioVlympnold or mixed-lineage teukem 
spindle pole body protein 
replication factor C (activator 1) 1 (14 
heat shock 70kD protein 9B (mortalln-2) 
G protein-coupled receptor kinase 5 
lysyl oxidase 

fight junction protein 2 (zona ocdudens 
tissue factor pathway inhibitor 2 
mftogen-acflvated protein kinase kinase 
kalfixreln 7 (chymotrypfic, stratum com 
mitogen-acttvated protein kinase 14 
sohjte carrier famDy 11 (protorvcoupled 
thyroid hormone receptor interactor 4 
transmembrane trafficking protein 
glucosamine (N-acetyl) transferase 2, 1 
DiGeorge syndrome critical region gene D 
aminotevutinate, delta-, dehydratase 
arninotevuflnate, delta-, dehydratase 
uroporphyrinogen decarboxylase 
decorin 

proliferating ceil nuclear antigen 
gfcHuman alpha satellite and satellite 3 
gtxHuman neural cell adhesion molecule ( 
tumor protein p53 (U-Fraumenl syndrome) 
RAP1A, member of RAS oncogene family 
RAS p21 protein activator (GTPase actrva 
cfiyinotrypsinogen B1 
cycOnBI 

cAMP responsive element binding protein 
RA82, member RAS oncogene family 
proteki phosphatase 3 (formerly 2B), cat 
O-6-rnethyiguanIne-ONA mefhyrtransf erase 
nldogen (enacfjn) 

protein kinase, cAMP-de pendent reguJato 
pentaxln-related gene, rapidly Induced *b 



M68874 
D28459 
M80254 
X59960 
X59960 
X59960 



X59960 
M83822 



Hs.211587 
Hs.80612 
Ha.173125 
Hs.77813 
Hs.77813 
Hs.77813 
Hs.77813 
Hs.77813 
Hs.62354 



EpsBon.lgE 

prostaglarclin-enaoperoxlde synthase 1 (p 
. transcrfption factor 6-Hke 1 (mitochond 
natriuretic peptide precursor C 
prKspholipase A2, group tVA (cytosoUc 
ubk^Uin-corifug ating enzyme E2A (RAD6 h 
pepSdytpcolyt isomerase F (cyctophffln 
sphingornyefln phosphodiesterase 1, add 
sphingomyelin phosphotfiesterase 1, acid 
sphingomyelin phosphodiesterase 1, add 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1. acid 
cell division cycle 4-fike 
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101812 
101813 
133396 
428161 
129026 
101901 
134831 
134039 
442355 
101975 
101977 
101978 
101998 
102003 
102007 
102009 
416658 
132951 
135389 
102048 
130145 
303153 



M86934 
M87338 
M96326 



M98B33 
S66793 
S72370 



S79873 



102095 
102123 
102126 
102133 
102139 
102162 
102164 
427653 
131617 
102200 
102210 
102214 
132811 
131319 
102256 
132316 



417526 
102293 
102298 
102325 
428734 
102361 
102367 
102388 
102394 
129829 
102409 
133746 
102423 
132828 
132828 
132828 
132828 
425322 
102450 
129350 
102534 
130457 
135065 
102560 
102567 
417173 
102638 
132736 
133070 
102663 
134660 
102735 
102741 
130564 
130564 
132164 



S83365 
U01212 
U01922 
U02556 
U02680 
U03272 
U04209 
U05237 
U07225 
U07620 
U09759 
U09820 
U11313 
U14518 
U14575 
U15173 
U15932 
U18291 
U18300 
U18383 
U20536 
U21551 
U23028 
U23752 
U25435 
U25997 
U28251 
U2B831 
U30245 
U32315 
U32439 
U32849 
U35139 
U36764 
U39400 
U39657 
U41344 
U41766 
U41813 
U43286 
U44378 
U44754 
U47011 
U47011 
U47011 
U47011 
U47077 
U462S1 
U50535 
U56833 



U58837 



U59863 
U67122 
U67319 
U68019 
U69611 
U70322 
U73524 
U79267 
U79291 
U82671 
U82671 
U84573 



BE439894 Hs.78991 
NM.002914HS.139226 
M96326 Hs.72885 
M96954 Hs.182741 
AL120297 Ks.108043 
H38026 Hs308 
AA853479 H&89890 
NM-002290H3.78672 
AA456539 H&8262 
AA079717 Hs^83664 
AF112213 Hs.184062 
BE561810 Hsi809 
U01212 Hs.248153 
U01922 Hs.125565 
U02556 Hs.75307 
BE245149 Hs.82643 
U03272 Hs.79432 
AW821182 Hs*1418 
U05237 Hs.99872 



U07225 
U34820 
U09759 
U72937 
U11313 



Hs.339 

Hs.151051 

Hs.246857 

Hs.96264 

Hs.75760 



NMJJ01809H8.1594 
AW950870 Hs.78961 
AU076845 Hs.155596 
NM_004419Hs.2128 
AA450274 Hs.1592 
NM_000107HsJ7602 
AA159001 Hs.180069 
U20536 Hs.3280 
AA232362 Ha. 157205 
BE619413 Hs.2437 
U23752 Hi32964 
U25435 Hs*7419 
NMjOQ3155HsJ5590 
U28251 Hs.53237 
U28831 Hs.44566 
U30245 

AA568906 Hs.82240 
AF090116 Hs.79348 
AA382169 Hs.54483 
A1815867 Hs.50130 
BE303044 Hs. 192023 
AA223616 Hs.75859 
U39656 Hs.118825 
AA362907 Hs.76494 
NNL003816HS.2442 
AF01Q258 Hs.127428 
BE300330 Hs.118725 
AW4 10035 Hs.75862 
Z47542 Hs.179312 
AB014615 Hs57710 
AB014615 Hs.57710 
AB014615 HS57710 
AB014615 Hs.57710 
U63630 Hs. 155637 
U48251 Hs.75871 
U50535 Hs.110630 
U96759 Hs.198307 
AB014595 Hs.155976 
AA019401 Hs.93909 
R97457 Hs.63984 
U63830 Hs.146847 
U61397 H&81424 
U67319 Hs.9216 
AW081883 Hs.211578 
U92649 Hs.64311 
NMJ»2270Hs.168075 
U73524 Hs.87465 
AF111106 Hs3382 
AW959829 Hs.83572 
U82671 Hs.36980 
U82671 Hs.36980 
A1752235 Hs.41270 



ONA segment numerous copies, expressed 
replication factor C (acfoatar 1 ) 2 (40 
azunxfafln 1 (caflonlc antimicrobial pro 
HA1 cytotoxic grartuk*assodated RNA-ti 
Friend teutemJa virus Integration 1 
arrestln 3, retinal (X-arrestin) 
pyruvate carboxylase 
lamlnin, alpha 4 

Ivsosornal-associated membrane protein 2 

aspartate beta-hydroxytase 

putafive RatxS-tnteracSng protein 

putaOve transmembrane protein; home leg 

olfactory marker protein 

translocase of inner mitochondrial membr 



protein tyrosine kinase 9 
fibrflEn 2 (congenita] contractural ara 
rrtaofibriDar-assodatBd protein 1 
fetal Alzheimer antigen 
purtoerglc receptor P2Y, G-protein coupl 
mitcgen-acfivated protein kinase 10 
mi togen-ecfivated protein kinase 9 
alpha thalassemi a/mental retardation syn 
sterol canter protein 2 
centromere protein A (17kD) 
protein phosphatase 1, regulatory (In nib 
BCL2/adenovirus E1B 19kD-Interacting pro 
dual specificity phosphatase 5 
CDC 16 (ceil dMsion cyde 16, S. cerevi 
damage-specific ONA binding protein 2 (4 
nuclear respiratory factor 1 
caspase 6, apoptosls-relatsd cysteine pr 
branched chain aminotransferase 1, cytos 
eukaryotlc translation initiation factor 
SRY (sex determining region Y)-bax 1 1 
CCCTC-binding factor (zinc finger protei 
stanntocaMn 1 

ESTs, Highly slmBar to Z169„HUMAN ZINC 
WAA1641 protein 

gb:Human rrryelomonocytlc specific protein 
syntaxin 3A 

regulator of G-protein signalling 7 
Nnmyc (and STAT) interactor 
necdin (mouse) hornotog 
eukaryotic translation InfflatJon factor 
dtfornosome 1 1 open reading frame 4 
mjtogen-actrvated protein kinase kinase 
proline arginhe-rich end ieudne-rich r 
a dlsintegrin and metaik) proteinase do ma 
homeoboxA9 

seleno phosphate synthetase 2 
MAD (mothers against decapentapleglc, Dr 
small nuclear RNA activating complex, po 
fibroblast growth factor 8 (androgerwnd 
fibroblast growth factor 8 (androgervind 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
protein kinase, DNA-actfvated, catalytic 
prated kinase C binding protein 1 
Human BRCA2 regbn, mRNA sequence O3006 
von HippeMJndau binding protein 1 
cuffln 4B 

cydlc nucleotide gated channel beta 1 

cadhertn 13. H-cadherin (heart) 

TRAP family rnember-assoclated NFKBactjv 

ubiquitin-iike 1 (sentrtn) 

caspase 7, apoptosis-felated cysteine pr 

Homo sapiens cONA: FU23037 fis, done L 

a cBstntegrin and metaEoproteinase doma 

karyopherin (importln) beta 2 

ATP/GTP-birxfing protein 

protein phosphatase 4, regulatory subuni 

hypothetical protein MGC14433 

melanoma antigen, family A, 2 

melanoma antigen, famfly A, 2 

procoilagen-fysEne, 2-oxogIutarate 5-dto 
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X64037 



102971 
I34037 
34037 
103Q23 
103037 
130282 
I34542 
28568 
28568 
03093 
13076 
129083 
124460 
11077 
103181 
103184 
103194 
03208 
I29698 
131486 
130729 
103334 
132645 
35094 
103352 X89398 
103352 
103353 
132173 
03371 
31584 
03376 
103378 
128510 
103410 
133490 
332689 
I03438 
103440 
103452 
133536 
420234 



X70649 
X72841 
X74987 
XB3107 
X84194 
X85753 
X87870 



X89399 
X89426 
X91247 
X91648 
X92098 
X92110 
X94703 



02823 U90914 D85390 Ks.5057 cartxaypepttase 0 

U91316 NH_007274Hs^679 cvtosofcacvl coenzyme A fliioesterrtydr 

102831 U91932 AA262170 Hs.80917 adaptor-related protein complex 3, sigma 

I02846 106131 BE264974 Hs.6566 IhyraM hormone receptor Eitteractor 13 

129777 U97018 U97018 Hs.12451 ecNnoderm n^crotubule^ssoclated protel 

134161 U97188 AA634543 HsJ9440 K3F-0 mRNA-Wnding protein 3 

34854 VD0503 JQ3464 Hs. 179573 cofegen, type i, alpha 2 

29257 X04327 AW 163799 Hs, 198265 2,3-bisphosphoglycerate rnutase 

13985 X06389 AI0 18666 Hs.75667 synaptoptrysin 

19768 X07496 T72104 Hs.93194 apoHpoproteln A-l 

02915 X07820 X07820 Hs.2258 matrix metafoproteinase 10 (stromerysin 

34656 X14787 AJ750878 Ks.87409 thfombospondln 1 

X15525 NM.001610HS.75589 add phosphatase % lysosomal 

X16396 AU076611 Hs. 154672 methylene tetrahydrofo(ata dehydrogenase 

X16609 X16609 Hs. 183805 anxyrftl, erythrocytic 

X53586 A1808760 Hs.227730 Weg fin, alpha 6 

X53586 AI808760 Hs.227730 Integrin,a]pna6 

X53793 AW500470 Hs. 117950 mumncflonal polypepQoe similar to S 

X54936 BE0 18302 Ks.2894 placental growth factor, vascular endoth 

X56740 BE245380 Hs.153952 5* nudeotldase (CD73) 

X57Q25 M14156 Hs.65112 taWce growth factor 1 (somatomedl 

X60673 H12912 Hs.274691 adenylate kinase 3 

X60673 H12912 Hs.274691 adenylate kinase 3 

X60708 S79876 Hs.44926 dipepQdylpeptidase IV (CD26, adenosine 

X62048 U10564 Hs.75188 weel (S. pombe) homdog 

X63097 X63094 Hs^83822 Rhesus blood group, D antigen 

BE275979 Hs.296014 polymerase (RNA) II (DMA directed) polyp 

AW977283 Hs.68257 general transcription factor IIF, polype 

XB9636 Hs.334731 Homo sapiens, done IMAGE3448306, mRNA, 

U43143 Hs.74049 fnmelated tyrosine kinase 4 

NM_0O4939Hs.78580 DEAD/H (Asp-Glu-AIa-Asp/Hls) box poh/pep 

AW411340 Hs.31314 retinobtastoma-blndlng protein 7 

BE242144 Hs.12013 ATP-btnding cassette, suo-famfly E (QABP 

F06972 Hs.27372 BMX norweceptor tyrosine kinase 

Al 963747 Hs. 18573 acytphosphatase 1, erythrocyte (common) 

NM_001260Hs25283 cycfirwtependent kinase 8 

AI654712 Hs.54424 hepatocyte nuclear factor 4, alpha 

NM_003304Hs.250687 transient receptor potential channel 1 

H09366 Hs.76853 uradWNAgtycosytase 

H09366 Hs.76853 ura*ONAgrycosytase 

X89399 Hs.119274 RAS p21 protein activator (GTPase activa 

X69426 Hs.41716 endothelial cetepedfic molecule 1 

X91247 Hs.13046 thbredoxfn reductase 1 

AA598509 Hs.29117 punne-rich element binding protein A 

AU336166 Hs.323378 coated veside membrane protein 

AL119690 Hs.153618 HCGVIB-1 protein 

X94703 RAB28, member RAS oncogene family 

AA158294 Hs.295362 DR1«ssodated protein 1 (negaQve cofac 

X97230 AF022044 Hs 274601 klercdl irrtmunoglobunrviike receptor 

X97230 AF022044 Hs. 274601 kffisrcetl irnrnunoglobuMke receptor 

X96263 AW1 75781 Hs.152720 M-phase phosphoprotein 6 

X96296 X98296 Hs. 77578 ubiquito specific protease 9, X chromes 

X99584 NNL006936HS.651 19 SMT3 (suppressor of mil two 3, yeasQ ho 

Y00264 W25797.comp Ks 177486 amyloid beta (A4) precursor protBin (pro 

Y07566 AW404908 Hs.96038 Ric (DrosophibHke, expressed in many 

Y07759 Y07759 Hs,170157 ri^hVAfl>eavy polypeptide^ 

Y07827 NfVL007048Hs^84283 butyrophifin, subfamily 3, member A1 

32083 Y07867 BE386490 Hs.279663 Pirin 

03500 Y09443 AW408009 Hs.22580 alkylgfycerone phosphate synthase 

134389 Y09858 Y09858 Hs.82577 splndln-ttke 

32084 Y12394 NfVL002267Hs.3886 teryophertn alpha 3 (Importin alpha 4) 
103540 Z11559 NM_002197Hs. 154721 aconflase 1, soluble 

133152 Z1 1695 Z11695 Hs.324473 rnilogeivactlvated protein kinase 1 

103548 Z15005 Z15005 Hs.75573 centromere protein E(312kD) 

103612 Z46261 BE336654 Hs.70937 H3 hlstone family, member A 

29092 AA011243 D56365 Hs.63525 potytrC^bindlng piuteh 2 

AA018418 AW137912 Hs.227583 Homo sapiens chromosome X map Xp1 1.23 L- 

103695 AA018758 ' AW207152 Hs.186600 ESTs 

129796 AA018804 BE218319 Hs.5807 GTPase Rab14 

434993 AA031993 AA306325 Hs.4311 SUMO-1 activating enzyme subunft 2 

132683 AA044217 BE264633 Hs.143638 WD repeat domain 4 

131887 AA046548 W17064 Hs.332848 SYYUSNF related, matrix associated, actl 

103723 AA057447 BE274312 Hs.214783 Homo sapiens cONA FU1 4041 te, clone HE 

I53368 AA058376 W20296 Hs.288178 Hcmo sapiens cDNAFUl 1968 Cs, done HE 

133260 AA083572 AA4O3045 Hs.6906 Homo sapiens cDNA: FU23197 fis, done R 

AA085696 AA085696 Hs.169600 WAA0B26 protein 
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103766 


AA088744 


AI920783 Hs.191435 


103767 


AA089688 


BE244667 


132051 


AA091284 


AA393968 Hs.180145 


103773 


AA092700 


AI219323 Hs.101077 


135289 


AA092968 


AW372569 Hs.9788 


409659 


AA094800 


AW970843 Hs.55682 


103794 


AA100219 


AF244135 Hs.30670 


131471 


AA1 14885 


AA164842 Hs.192619 


134319 


AA129547 


BE304999 Hs.285754 


103607 


AA133016 


AW958264 Ha. 103632 


446392 


AA149507 


AF142419 Hs.15020 


129863 


AA151005 


BE379765 Ha. 129872 


103850 


AA187101 


AA187101 Hs.213194 


103855 


AA195179 


W02363 


103861 


AA208236 


AA206236 Ks.4944 


130634 


AA227621 


AI769067 Hs. 127824 


447735 


AA248283 


AA775268 Hs.6127 


103909 


AA249611 


AA249611 Hs.47438 


458928 


AA282640 


AF043117 Hs.24594 


415824 


AA287199 


D42039 Hs.78871 


129013 


AA313990 


AA371156 Hs.107942 


129435 


AA314256 


AF151852 Hs.111449 


103988 


AA314389 


AA314389 Hs.342849 


104000 


AA324364 


AI146527 Hs.80475 


425284 


AA329211 


AF155568 Hs.348043 


128829 


AA399187 


AL096748 Hs.102708 


133281 


AA421079 


AK001601 Hs.69594 


104104 


AA422029 


AA422029 H3.143640 


332455 


AA425230 


NM_005754Hs220689 


132091 


AA447052 


AW954243 


135073 


AA452000 


W55956 Hs.94030 


131367 


AA456687 


AI750575 Hs.173933 


129593 


AA487015 


AI338247 Hs.98314 


133505 


C01527 


AI630124 Hs.324504 


132064 


C01714 


AA121098 Hs.3838 


442351 


C01811 


W52642 Hs.8261 


131427 


C02352 


AF151879 Hs.26706 


433892 


C02375 


AJ929357 Hs.323966 


104282 


C14448 


C14448 Hs.332338 


134827 


016611 


BE314037 Hs.89866 


425330 


D25216 


D25216 Hs. 155650 


131742 


031352 


AA961420 Hs.31433 


456935 


058024 


AA370362 Rs.57958 


425218 


D80897 


NM_014909Hs.155182 


104334 


082614 


D82614 Hs.78771 


134593 


087845 


NMJXW437HS.234392 


134731 


D89377 


D89377 H3.89404 


445776 


H06583 


NM.001310H3.13313 


131670 


H40732 


H03514 Hs.15569 


104394 


H46617 


AA1 29551 Hs.172129 


104402 


H56731 


H56731 Hs.132956 


439130 


H75570 


AA3Q6090 Hs.124707 


129077 


H7S886 


N74724 Hs.108479 


104417 


H81241 


AI819448 Hs.320861 


134927 


L36531 


136531 H&91296 


129280 


M63154 


M63154 Hs.110014 


134498 


M63180 


AW246273 Hs.84131 


104460 


M91504 


AW955705 Hs.62604 


104488 


N56191 


N56191 Hs.106511 


131248 


N78483 


AJ038989 H3.332633 


130017 


R14652 


AK000096 Hs.143198 


104530 


R20459 


AK001676 Hs.12457 


104534 


R22303 


R22303 


104544 


R33779 


AI091173 Hs.222362 


133328 


R36553 


AW452738 Hjl265327 


104567 


R64534 


AA040620 Hs.5672 


129575 


R70621 


F08282 Hs.276428 


130776 


R79356 


AF167706 Hs.19280 


104599 


R84933 


AW815036 Hs.151251 


104660 


AA007160 


8E298665 Hs.14846 


104667 


AA007234 


AI239923 Hs.63931 


104718 


AA018409 


AI143020 Hs.36250 


104764 


AA025351 


AKJ39243 Ks.278585 


104786 


AA027168 


AA027167 Hs.10031 


104787 


AA027317 


AA027317 


134079 


AA029423 


AK001751 H3.171835 



ESTa 

CGMOO protein 
HSPC030 protein 

ESTa, WeaMy similar to T22363 hypothetl 

hypolhefical protein MGC10924 similar to 

euicaryotic translation WMon factor 

hepatDceButarcardnoma-assodatedanfl 

WAA1600 protetn 

fumarate hydratase 

similar to yeast UpD, variant B 

homoiog of mouse quaking OKI (KH domain 

sperm associated antigen 9 

hypofliefical protein MGC10895 

hypometfcal protein RJ10330 

hypothetical protein FU12783 

ESTs, Weakly similar to T28770 hypotheti 

Homo sapiens cDNA: FU2302Q fis, done L 

SH3 domain binding glutamic add-tkb pr 

ubiquiflnaflon factor E4B (homologous to 

mesodenn development candidate 2 

0KFZP564M112 protein 

CGI-94 protein 

ADP-ribosylation factor-like 5 

polymerase (RNA) II (DNA deeded) polyp 

NS 1 -associated protein 1 

DKFZP434A043 protein 

h^h-mobmty group 20A 

ESTs, Weakly similar to hyperpofarlzaflo 

Ras-GTPas^acUvaling protein SH3-dornaln 

WAA0251 protein 

Homo sapiens mRNA; cDNA DKFZp586E1 624 (f 
nuclear factor l/A 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
Homo sapiens mRNA; cONA DKFZp586J072O (f 
serurrhinducibie kinase 
hypothetical protein FU22393 
CGI-121 protein 

Homo sapiens clone H63 unknown mRNA 
EST 

coproporphyrinogen oxidase (coproporprryr 

KIAA0014 gene product 

ESTs 

EGF-Tl^^trophiBrHelated protein 
KIAA1036 protein 
phosphoglycerato kinase 1 
platelet-activating factor acetylhydrola 
msh (Drosophila) homeo box homotog 2 
cAMP responsive element binding proteln- 
ESTs 

Homo sapiens cDNA: RJ21409 fis, clone C 

ESTs 

ESTs 

ESTs 

KruppeW&e factor 8 
tntegrtn, alpha 8 

gastric intrinsic factor (vitamin B synt 

ftreonykRNA synthetase 

Homo sapiens, clone IMAG&4299322, mRNA, 

protocadhenn 17 

Bardet-Biedl syridrome 2 

ir^tor of growfii family, member 3 

hypothetical protein FU10814 

gb7h26b09/1 Scares placenta Nb2HP Homo 

ESTs. Weak* similar to p40 [Ksapiens] 

Irypotfietica! protein DKFZp761H41 

hypothetical protein AF140225 

progestin induced protein 

cystetne-ncb motor neuron 1 

ESTs 

Homo sapiens mRNA; cDNA DKFZp5640016 (fr 
ESTs 

ESTs. Weakly similar to 138022 hypothetl 
ESTs 

WAA0955 protein 

gbae97d1U1 SoaresJetaLheartNbHH19W 
hypothetical protein FU10889 
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104604 
04865 
30828 
04907 
04943 
05013 
05024 
32592 
05038 
05077 
05096 
29215 
05169 
32796 
127210 
05200 
30114 
05330 
05337 
22040 
05376 
05397 
431679 
31991 
421305 
05489 
05508 
05539 
35172 
31569 
431129 
I05643 



05674 
! 05709 
I05722 
I05765 
15951 
30884 



06008 
457322 

34222 
446954 
106141 
447973 
106157 
128314 
446727 
106196 

57714 

33200 
,05302 
I06328 
450534 
I06423 
439608 
I06477 

06503 



I06543 
442007 
106589 
08593 
06596 
[23064 
06636 
06654 
31353 
06707 
52909 
06717 
153141 
06747 



AA031357 
AA045136 
AA053400 
AA055829 
AA065217 
AA116054 
AA126311 
AA 129390 
AA130273 
AA142919 
AA150205 
AA176867 
AA180321 
AA 180487 
AA187634 
AA195399 
AA234717 
AA234743 
AA234957 
AA235604 
AA236559 
AA242868 
AA251776 
AA251909 
AA252672 
AA256157 
AA256680 
AA258873 
AA262727 
AA281451 
AA281545 
AA282069 
AA283044 
AA283930 
AA284755 
AA291268 
AA291927 
AA343514 
AA398109 
AA398109 
AA405737 
AA406610 
AA411465 
AA416886 
AA424013 
AA424148 
AA424558 
AA424961 
AA425367 
AA425921 
AA426220 
AA427735 
AA4 30673 
AA432248 
AA435896 
AA436705 
AA446561 
AA448238 
AA449756 
AA450303 
AA452411 
AA454566 
AA454667 
AA456437 
AA456646 
AA456826 
AA456981 
AA458959 
AA459950 
AA460449 
AA463910 
AA464603 
AA464606 
AA465093 
AA465692 
AA476473 



A1858702 Hs.31803 
T79340 Hsi2575 
AW631469 Hi203213 
AA055829 Hs.198701 
AF072873 Hs.114218 
H63789 HS396288 
AA126311 Hs.9879 
AW803564 Hs-286650 
AW5C3733 Hs.9414 
W55946 Hs334863 
AU042506 Hai1599 
AB040930 Hs.126085 
BE245294 Hs.180789 
NM_006283Hs,173159 
BE396283 Hs.1 73987 
AA328102 Hs.24641 
AA233393 Hs. 14992 
AW338625 Hs.22120 
AJ4687B9 Hs.347187 
AA172106 Hs.1 10950 
AW994032 Hs.8768 
AA814807 Hs.7395 
AK000O46 Hs.343877 
AP053306 Hs.36708 
BE397354 Hs.324830 
AA256157 Hs.24115 
AA173942 Hs.326416 
A6040884 Hs. 109694 
AB028956 Hs.12144 
AL389951 Hs.271623 
AL137751 Hs.263671 
BE621719 Hs.173802 
AA283044 H3.25625 
AA426234 H3.34906 
AJ609530 Hs.279789 
AI928962 Hs.26761 
AJ922621 Hs.32433 
AA299688 Hs.24183 
BE546245 Hs.301048 
BE546245 Hs301048 
AW880358 Hs.339808 
AA406610 
AB033888 H3.8619 
AI815486 Hs.243901 
AW855861 Hs.8025 
AB037850 Hs.16621 
AF031463 Hs.8302 
AB011169 Hs.20141 
W37943 Hs.34892 
AW135049 Hs.26285 
AB011095 Hs.1 6032 
AA525993 Hs.1 73699 
AA083764 

AB037715 Hs,183639 
AA398859 Hs.18397 
AL079559 Hs.2S020 
AI570169 Hs35132 
AB020722 Hs.16714 
AW864696 Hs.301732 
R23324 Hs.41693 
A6033042 Hs. 29679 
AA151520 

AA676939 Hs.69285 
AA301116 Hs.142838 
AK000933 Hs28661 
AW296451 Hs.24605 
AA452379 
AF265208 Hs.8740 
AW958037 Hs^86 
AW075485 HsJ286G49 
AW754182 
AK000566 Hs.98135 
NM-.015368H830985 
AA600357 Hsi39489 
AB014548 HU1921 
NMJ0O7116HS.171957 



ESTs, Weakty similar to N-WASP Disapfen 
B-ceU CLUtymphoma 6, member B {zinc fi 
ESTs 

ESTs, Weakly similar to ALU 1 .HUMAN AW S 

frizzled (DrcsophUa) homo log 6 

ESTs, Weakly similar to WAA0638 protein 

ESTs 

Homo sapiens cDNA: FU22528 fis, dona H 
KIAA1488 protein 

Homo sapiens cDNA FU12Q82 fis, done HE 
KoippeMIka factor 7 (ubiquitous) 
K1AA1497 protein 
S164prot9tn 

trartsfcxmtng, addic oolled-cofl contain 
eukaryofic translation Initiation factor 
cytoskeleton associated protein 2 
hypothetical protein aJ 11 151 
ESTs 

nryotubtdarin related protein 1 
Rag C protein 

hypolheflca! protein RJ10849 
hypotheSca) protein FU23182 
hypothetical protein RJ20039 
budding uninhMed by benzfmidazoles 1 
dlpfheria toxto resistance protein requl 
Homo sapiens cONA RJ14178 fis, done NT 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
KIAA1451 protein 
K1AA1033 protein 
nucteoportn SOU) 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

KJAAQ603 gene product 

hypothetical protein FU1 1323 

ESTs, Wealdy similar to T17210 hypofoeS 

DKFZP586L0724 protein 

ESTs 

ESTs 

seel 3-fike protein 

sect 3-like protein 

hypothetical protein HJ10120 

gb2v15b1(Ls1 Soares_NhHMPu_S1 Homo sapi 

SRY (sex determining region Y>box 18 

Homo sapiens cONA FU20738 fis, done HE 

Homo sapiens done 23767 and 23782 mRNA 

DKFZP434I1 16 protein 

ptiosiudn-Bke 

similar to S. cerevtsiae SSM4 
WAA1323 protein 

Homo sapiens cDNA FU 10643 Ms, done NT 
WAA0523 protein 

ESTs, Wealdy simitar to ALU1_HUMAN ALU S 

hypothetical protein MGC3178 

hypothetical protein RJ 10210 

hypotheticaJ protein RJ23221 

WAA0766 gene product 

WAA0470 gene product 

Rho guanine exchange factor (GEF) 15 

hypothetical protein MGC5306 

DnaJ (Hsp40) homotog, subfamily 8, membe 

cotactorreQUbedforSpl trans crfpfiona 

hypometloal proteh MGC4485 

wuropEnl 

nudeoiar pnosphoprotein Nopp34 

Homo sapiens cDNA RJ 10071 fis, done HE 

ESTs 

ESTs, Moderately similar to ALU7 JfUMAN A 
SWVSNF related, matrix associated, act! 
ribosomai protein L4 
pnospnosenne aminotransferase 
gb*C2-CT0321-131199411-c01 CT0321 Homo 
hypothetical protein FU20559 
pannexin 1 

TIA1 cytotoxic granute-assodated RNA-fal 
K1AA0648 protein 

triple Junctional domain (PTPRF interact 
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100773 M478109 
108781 M478474 
108817 AA460889 
106848 AA485223 
106848 AA465254 
106856 AA486183 
418899 AA496336 
107001 AA598589 
442853 AA595831 
107054 AA600150 
107059 AA608545 
107080 AA609210 
107115 AA610108 
107130 AA620582 
107156 AA621239 
107174 AA621714 
130621 AA621718 
107190 019673 
132626 D25755 
107217 D51095 
332584 060272 . 
444655 T08879 
107295 T34527 
107299 T40327 

107315 T62771 

107316 T83174 
107328 T83444 
107334 T93641 
456340 048263 
128636 U49065 
129938 U79300 
107375 U88573 
130074 U93867 
107387 W01094 
132036 W01S68 
107426 W26853 
135388 W27965 
130419 W36280 
107469 W47063 
434203 W79060 
107506 W88550 
132358 X60486 
107522 X78931 
456495 Z14077 
107582 AA002147 
107609 AA004711 
107661 AA010383 
107714 AA015761 
107775 AA018772 
107832 AA021473 
107859 AA024835 
107914 AA027229 
107935 AA029428 
410196 AA035143 
131461 AA035237 
108007 AA039347 
108029 AA040740 
108040 AA041551 
108084 AA045513 
108088 AA045745 
108168 AA055348 
130719 AA056582 

108189 AA056697 

108190 AA056746 
108203 AA057678 

108216 AA058681 

108217 AA058686 
108245 AA062840 
108277 AA064S59 
108280 AA065O69 
108309 AA069923 
108340 AA070815 
108403 AA075374 
108427 AA076382 
108435 AA078787 
108439 M078986 



AA478109 Hs.188833 

AA330310 Hs.24181 

061216 Hs.18672 

AB037744 Hs.34892 

AA449014 Hs.121025 

W58353 Ks^85123 

BE539839 Hs.173030 

A1926520 Hs.31016 

AW021276 Hs.17121 

A1076459 Hs.15978 

BE614410 Hs.23044 

AL122043 Hs.19221 

BE379623 H8.27693 

AB033106 Hs.12913 

AA137043 Hs.9663 

BE122762 Ha.25338 

AW513087 Hs.16803 

AA836401 Hs.87860 

AW504732 Ha.21275 

AU080235 Hs.35861 

AA357879 Ha.29423 

AF088886 Hs.11590 

AA186629 H&80120 

BE277457 Hs.30661 

AA316241 Hs.90691 

T63174 Hs.193700 

AW959891 Hs-76591 

T93597 Hs.187429 

U48263 Hs.89040 

U49065 H3.102865 

AW003668 Hs.135587 

BE011845 Hs.251064 

A1038596 Hs.250745 

D86983 Hs.118893 

AL157433 Hs.37706 

W26853 Hs.291003 

W27965 Hs.99865 

AF037448 Hs.155489 

W47063 Hs.94668 

BE262677 H3.283558 

AB028981 Hs.8021 
NM.003542H3.46423 

X78931 Hs.99971 
NMJ)03403Hs.97496 

AA002147 Hs.59952 

R75654 Hs.164797 

AA010383 Hs.60389 

AA015761 Hs.60642 

AW008846 Hs.60857 
AA021473 

AW732573 Hs.47584 

AA027229 Hs.61329 

AA029428 Hs.61555 

AB36442 Hs.59833 

AA992341 H&27263 

AA039347 Hs.61916 

AA040740 H3.62007 

AL121031 Hs.159971 

AA056944 Hs.1 16602 

AA045745 Hs.62886 

AI453137 Hs.63176 

AA679262 Hs.14235 

AW376061 Hs.63335 

AA056746 Hs.63338 

AW847814 Hs.289005 

AA524743 Ks.44883 

AA056666 Ha .62583 

BE410285 Hs.89545 
AA064859 
AA065069 
AA069818 

AA069820 Hs.180909 

AA075374 

AA076382 

T82427 Hs.194101 
AA078986 



ESTs 
ESTfl 
ESTs 

K1AA1 323 protein 

chromosome 1 1 open reading frame 5 
Homo sapiens mRNA full length insert cDN 
ESTs, WeaWy similar to ALU8 JiUMAN ALU S 
putative ONA Undhg protein 
ESTs 

KIM1272 protein 

RAD51 (S. cerevfelae) Nemo log (EooflRe 
hypodudcal protein DKFZp566G1424 

■ i i.i till, il... nt I In nmrn-n nn /_ .>J..~VRT~\ I 

pepuoyipforyi comerase icycropruunjH 
WAA128Q protein 

programmed oeD death 6-lrtteractlng prot 
ESTs 

LUC7(S.cerevfelae^te 
ESTs 

hypothetical protein RJ1101 1 

OKF2P586E1621 protein 

ESTs; WeaMy similar b macrophage (ecfJ 

cafhepsinF 

UDP^cetyl^pha^aiactDsarT*e:po}yp 
hypothetical protein MGC4606 

Homo sapiens mRNA: cDNA OKFZp586l0324 (f 

WAA0887 protein 

ESTs 

prepronodcepfin 
tnterieukln 1 receptor-like 2 
Human done 23629 mRNA sequence 
high-mobfiity group {nonhistone chromoso 
polymerase (RNA) III (DNA directed) (62k 
Melanoma associated gene 
hypothetical protein DKFZp434E2220 
hypothetical protein MGC4707 
eplmorphln 

NS1-asso elated protein 1 
ESTs 

hypothetical proteh PR01 855 

WAA1058 protein 

H4 hlstone family, member G 

zinc finger protein 272 

YY1 transcription factor 

EST 

hypothetical protein RJ13693 

ESTs 

ESTs 

ESTs 

gb:ze66c1 1 .si Soares retina N2MHR Homo 
potassium voltage-gated channel, delayed 
ESTs, Weakly similar to T16370 hypottietl 
ESTs 

hypothetical protein FU10808 

WAA1458 protein 

EST 

ESTs 

SW1/SNF related, matrix associated, ad 
Homo sapiens, done IMAGE: 41 54008, mRNA, 
ESTs 
ESTs 

hypothetical protein FU20008; KIAA1839 
ESTs, Moderately similar to A46010 X-fin 
EST 

Homo sapiens cDNA: FU21532 fis, clone C 

ESTs 

ESTs 

proteasome (prosome, macropain) subunft, 
gb2m50f03.s1 Stratagem fibroblast (937 
gb2m12e1U1 Stratagene pancreas (93720 
gb2m67e03.r1 Stratagene rtairoepttheflum 
peroxtredoxln 1 

gb:zm87a01 M Stratagene ovarian cancer 
gb:zm91g08.s1 Stratagene ovarian cancer 
Homo sapiens cDNA FU20869 fis, clone A 
gb:zm92h0U1 Stratagene ovarian cancer 
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108465 AA079393 
108469 AA079487 

108500 AA083207 

108501 AA083256 
108533 AA084415 
108562 AA0S5274 
108589 AA0B8678 
130890 AA100925 
432645 AA101255 
130385 AA126474 
108749 AA127017 

108807 AA129968 

108808 AA130240 
108833 AA131866 
108846 AA132983 
108857 AA1 33250 
131474 AA133583 
108894 AA135941 
106941 AA148650 
108968 AA151110 
108998 AA155754 
109001 AA156125 
131183 AA156289 
109019 AA156997 

109022 AA157291 

109023 AA157293 
109068 AA164293 
109072 M164676 
426981 M167375 
130346 AA167550 
109146 AA176589 
109172 AA180448 
428438 AA187144 
129208 AA189170 
109222 AA192757 
109300 AA205650 
109481 AA233342 
109485 AA233472 
109516 AA234110 
109537 D80981 
109556 F01660 

109577 F02206 

109578 F02208 
109595 F02544 
109625 F03918 
426376 F04258 
109648 F04600 
109671 F08998 
109699 (=09605 
109820 F11115 
109933 H06371 
110014 H10995 
110039 H11938 
110099 H16568 
110107 H16772 
110155 H1B951 
110197 H20859 
110223 H23747 
110306 K38087 
110335 H40331 
110342 H40567 
110395 H46966 
110511 H56640 
110523 H57154 
110715 H96712 
110754 (420814 
428454 N25249 
431663 N27100 
134263 N39616 
110938 N48982 
110983 N51957 
111081 N59435 
111128 N64139 
431548 N66981 
111216 N68640 
437562 N69352 



AA079393 Hs.3462 
AA079487 

AA033207 Hs.68270 
AA083256 
AA084415 
AA100796 

AI732404 Ha.68846 
A1507537 Hs.76698 
D14041 Hs.347340 
AW067800 Hs.155223 
AA127017 Hs.71052 
A1652236 Hs.49376 
AA045088 Hs.62738 
AF188527 Ha.61661 
AL117452 H$.44155 
AK001458 H3.621S0 
L46353 Hs.2728 
AK001431 Hs-5105 
AA148650 

AI304870 Ks.188680 
AW995610 Hs.332436 
AI056548 Hs.72116 
AI611607 Hs.285107 
AA156755 Hs.72150 
AA157291 Hs^1479 
AA157293 Hs.72168 
AA164293 Hs.72545 
AT732585 Hs.22394 
ALQ44675 Hs.173081 
H05769 H8.188757 
AA176589 Hs.142078 
AA180448 Hs.144300 
NM_001955Hs.2271 
AI587376 Hs.109441 
AA192833 Hs.333512 
AA418276 Hs.170142 
AA878923 HS.289G69 
BE619092 Hs.28465 
AW71639 Hs.71913 
AI858695 Hs.34898 
AI925294 Hs.87385 
F02206 Hs.296639 
FO2208 Hs.27214 
AA078629 Hs.27301 
H29490 Hs.22697 
AF119665 Hs.184011 
H 17800 Hs.7154 
R59210 Hs.26634 
H18013 Hs.167483 
AW016809 Hs.119021 
R52417 Hs.20945 
AL109666 Hs.7242 
H11938 Hs.21907 
R44557 Hs.23748 
AW151660 Hs.31444 
M559626 Hs.93522 
AW090386 Hs.1 12278 
H19836 HSJ1697 
H38087 Hs.105509 
H65490 Hs.18845 
H40961 Hs.33008 
AA025116 HSJ3333 
K56640 Hs321460 
AI040384 Hs,19102 
H96712 Hs.269029 
AW3Q2200 Hs.6336 
U55936 Hs.184376 
NM_016569Hs.267182 
AW973443 Hs.8086 
N48982 Hs.38034 
NM_015367Hs.1O267 
AI146349 Hs.271614 
AW505364 Hs. 19074 
AI834273 Hs.9711 
AW139408 Ha152940 
AB001636 Hs.5683 



cytochrome c oxidase subunlt Vlic 
gb2m97fD8.8l Stratagene colon KT29 (937 
EST 

gfczn08g12*1 Stratagem hKTT neuron (937 
gb2n08g09 j1 Stratagene hNT neuron (937 
gb2m26c0&s1 Stratagene pancreas (93720 
ESTs 

stres3-assodatfid endoplasmic reflcuium 
H-2K binding tactor-2 
8tannbcaldn2 
ESTs 

hypothetical protein RJ20644 
ESTa 

ESTs, WeaWysimCarto AF174605 1 F4x» 

OKFZP586G1517 protein 

anfflin (Drosophfla Scraps homoiog), ad 

higtwnobiMy group (nonhistone chramoso 

hypothetical protein FU10569 

gbao09e06^1 Stratagene rreuroepitheBum 

ESTs 

EST 

hypothetical protein FU20992 similar to 
hypothetical protein FU 1 3397 
ESTs 

ubinudeln 1 

ESTs 

ESTs 

hypofoeticai protein FU10893 
WAA0530 protein 

Homo sapiens, done MGC5564, mRNA, comp 

EST 

EST 

endofheQn 1 
MSTP033 protein 
similar to rat rnyornegalin 
ESTs 

hypothetical protein RJ21016 

Homo sapiens cONA: FU21869 fis, done H 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit ( 

ESTs 

ESTs 

ESTs 

pyrophosphatase (Inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens done 24993 mRNA sequence 

Homo sapiens mRNA full length insert cDN 

hfstone acetyitransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1647 protein, 

arrestin, beta 1 

ESTs 

CTL2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to organic anion tr 
ESTs 

WAA0672 gene product 
synaptosomakssodated protein, 23 kD 
TBX3-bo protein 

RNA (guanhe-7-) methyttransferase 
Homo sapiens cDNA RJ12924 fe, done NT 
M1L1 protein 
CGI-1 12 protein 

LATS {large tumor suppressor, Drosophlta 

novel protein 

ESTs 

DEAD/H (Asp-GiiMla-AspnHis) box polypep 
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111399 R00138 


AW270776 Hs.18857 


111514 R07998 


R07998 




428744 R0S929 


BE267033 Hs.192853 


111574 RKK07 


AI024145 


Ks.188526 


111804 R33354 


AA482478 Hs.181785 


111831 R36083 


R36095 


Hs368695 


426773 R37938 


NM.015556HS.172180 


111904 R39330 


Z41572 




428371 R40816 


AB012193 Hs.183874 


112033 R43162 


R49031 


Hs22627 


130987 R45698 


BE613269 


HsJM893 


112300 R54554 


H24334 


Hs.26125 


112513 R68425 


R68425 


Hs.13809 


112514 R68568 


R68568 


Hs.183373 


112522 R68763 


R68857 


Hs.265499 


112540 R70467 


R69751 




428655 R73565 


K05769 


Hs.188757 


129534 R73640 


AK002126 


Hs.11260 


112597 R78376 


R7B376 


Hs^9733 


112732 R92453 


R92453 


Hs34590 


451798 T03865 


BE297567 


H127047 


112888 1103672 


AW195317 Ha 107716 


131863 T10072 


AI656378 


Hs33461 


112911 T10080 


AW732747 Hs.13493 


132215 T10132 


AL035703 


Hs.4238 


112931 T15343 


T02966 


Hs.167428 


112984 T23457 


T16971 


Hs.289014 


112998 T23555 


H11257 


Hs.22968 


133376 T23670 


BE618768 Hs.7232 


113026 T23348 


AA376654 




113070 T33464 


AB032977 


Hs.6298 


410781 T34413 


A1375672 


Hs.165028 


113074 T34611 


AK001335 


Hs^1137 


113095 T40920 


AA828380 Hs.126733 


113179 T55182 


BE622021 


Hs.152571 


113337 T77453 


T77453 


Hs.302234 


113421 T84039 


A1769400 


Hs.189729 


113454 T86458 


AI022166 


Hs.16188 


113481 T87693 


T87693 


Hs.204327 


453345 T89350 


AA302862 


Hs.90063 


113557 T90945 


H66470 


Hs.16004 


113559 T90987 


T79763 


Hs.14514 


113589 T91863 


AI078554 


Hs.15682 


113591 T91881 


T91881 


Hs.200597 


113619 T93783 


R08665 


Hs.17244 


113683 T96687 


AB035335 


Hs.144519 


113692 T96944 


AL360t43 


Hs.17936 


113702 T97307 


T97307 




113717 T97764 


T99513 


Hs.187447 


113824 W48817 


AI631954 


H&34447 


113840 W58343 


R72137 


Hs.7949 


113844 W59949 


AI369275 


Hs243010 


113902 W74644 


AA340111 


Hs.100009 


113904 W74761 


AF125044 


Hs.19196 


113905 W74802 


R81733 


Hs.33106 


113931 W81205 


BE255499 


Ks.3498 


113932 W81237 


AA256444 


Hs. 126485 


131965 W90146 


W79283 


Hs.35962 


114035 W92798 


W92798 


Hs.269181 


114106 Z38412 


AW602528 




457308 Z38709 


AI416988 


Ha.238272 


114161 Z38904 


BE548222 Hs.299883 


424949 Z39103 


AF052212 Hs.153934 


457548 Z39930 


AW069534 H&279583 


128937 Z39939 


AA251380 Hs.10726 


432554 Z40012 


AI479813 


Hs.278411 


114277 Z40377 


AI052229 


Hs.25373 


114304 Z40820 


AI934204 


Hs.16129 


114364 Z41680 


AL117427 


Hs.172778 


432620 AA005112 


AA777749 


Hs^978 


129034 AA005432 


AA481157 Hs.108110 


131881 AA010163 


AW361016 HSJ383 


332421 AA026356 


A1909968 


Hs.108108 


114465 AA026901 


8E621056 


Hs.131731 


451271 AA036867 


AK001644 HS26156 


332498 AA044644 


AA303661 





ESTs 

gb7f16g1U1 Scares fetal Over spleen 

ubSquI&vconjugaCng enzyme E2G 2 (homo 

ESTs 

ESTs 

ESTs 

K1AA0440 protein 

gb:HSCZYB122 normalized infant brain cON 

cuflin4A 

ESTs 

hypothetical protein DKFZp761N0624 
ESTs 

hypothetical protein RJ 10648 

src homology 3 domain-containing protein 

ESTs 

gfcyM0a10.s1 Scares placenta Nb2HP Homo 

Homo sapiens, done MGC5564, mRNA, comp 

hypofiw8catprotehRJ11264 

EST 

ESTs 

hypofoetfcal protein FU20392 
hypothetical protein RJ 22344 
ESTs 

like mouse brain protein E46 
WAA0478 gene product 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
Homo sapiens clone IMAGE:451939, mRNA se 
acetyl-Coenzyme A carboxylase afcha 
eukaryotlc translation Inttiafion factor 
WAA1151 protein 
ESTs 

protein tyrosine phosphatase, receptor t 
ESTs 

ESTs, Highly similar to IGF-tt mRNA-Wnd 

ESTs 

ESTs 

ESTs 

EST 

neurocakin delta 
ESTs 
ESTs 
ESTs 

KIAA0563 gene product 
hypothetical protein FU13605 
T-ceS leukemiayiyrnphoma 6 
DKF2P434H132 protein 
gb:ye53h05.$1 Soares fetal Dver spleen 
ESTs 
ESTs 

OKFZP586B2420 protein 

Homo sapiens cDNA FU14445 fis, clone HE 

acyt-Coenzyme A oxidase 1 , palmitoyl 

ublquffirwonjugafing enzyme HBUCE1 

ESTs 

hypothetical protein MGC15749 
hypothetical protein aJ12604; WAA1692 
ESTs 
ESTs 

gb:RC5-B70562-260100-01 1-A02 BT0562 Homo 
inositol 1,4,5-tnphosphato receptor, ty 
hypothetical protein FU23399 
core-binding factor, runt domain, alpha 
CGM1 protein 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

NCK-asscctatsd protein 1 

ESTs, Weakly stmflar to T2041 0 hypotietl 

ESTs 

Homo sapiens mRNA; cDNA 0KFZp566P013 (tr 
UM domain only 7 
DKFZP547E2110 protein 
upstream regulatory element binding prot 
tra/iscnpflon factor 
hypoaietteal protein FU11099 
hypofliefcal protein FU 10782 
lyrrtphocyte-specfflc protein 1 
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431555 AAD46426 
132944 AA054515 
114618 AA084162 
332509 AA085749 
5 114648 AA101056 
114858 AA102746 
132455 AA1 14250 
450847 AA126561 
132225 AA128980 

10 437197 AA129757 
114709 AA129921 
456926 AA133331 
114750 M135958 
426806 AA136524 

15 114763 AA147044 
114767 AA148885 
114774 AA150043 
129388 AA151621 
457742 AA155743 

20 456200 AA156335 
130207 AA156336 
114798 AA159181 
114800 AA159825 
114828 AA234185 

25 114846 AA234929 
114848 AA234935 
114902 AA236359 
132271 AA236466 
114907 AA236535 

30 420170 AA236935 
132204 AA236942 
114928 AA237018 
132481 AA237025 
114932 AA242751 

35 314162 AA242760 
131006 AA242763 
114935 AA242809 
406908 AA243133 
437754 AA243495 

40 114957 AA243706 
114974 AA250848 
114977 AA250868 
114995 AA251152 
115005 AA251544 

45 417177 AA251792 
115026 AA252144 
115045 AA252524 
115068 AA253461 
133138 AA255522 

50 332668 AA255522 
115114 AA256468 
129584 AA256528 
115137 AA257976 
417187 AA258296 

55 115166 AA258409 
115167 AA258421 
436719 AA262077 
115239 AA276650 
115243 AA278766 

60 428419 AA280791 
115322 AA28C819 
413303 AA280828 
115372 AA282195 
409962 AA283127 

65 130269 AA284694 
456570 AA291137 
332675 AA291708 
407864 AA293495 
115536 AA347193 

70 408799 AA398474 
115575 AA398512 
115601 AM00277 
434428 AA40O8S6 
115683 AA410345 

75 115715 AA416733 
132952 AA425154 



AJ815470 Hs.260024 

T96641 Hs.6127 

AW979261 Hs.291993 

AA128376 Hs.153884 
AA101056 

AA102383 Hs^49190 

AB011084 Hs.48924 
NM_003155Hs^5590 
AA128980 
W38586 

AA397651 Hs301959 

AB018284 Hs.158688 

AA887211 Hs.129467 

T19228 Hs.172572 

AA810755 Hs.102500 

AJ859865 Hs.1 54443 

AV656017 Hs.184325 

AA662477 H3.110984 

BE561824 Hl273369 

AA768242 Hs.80618 

AF044209 Hs.144904 

AA159181 Hs.54900 

Z19448 Hs.131887 

AA252937 Hs.283522 

BE018682 Hs.168196 

BE614347 Hs.168615 

AW275480 Hs.39504 

AB030034 Hs.1 15175 

N29390 Hs.13804 

U43374 Hs.95631 

AA235627 Hs.42265 

AA237018 Hs.94869 

W93378 Hs.49614 

AA971436 Hs.16218 

BHM1820 Hs.38516 

AF064104 Hs.22116 

H23329 H&290880 

BE296227 Hs.250822 

R60366 Hs.5822 

AW17042S Hs.87680 

AW966931 Hs.302649 

AW296978 Hs.87787 

AA769266 Hs.1 93657 

AI760825 Hs.153042 
NM.004458HS.81452 

AA251972 Hs,188718 

AW014549 Hs,58373 

AW512260 Hs. 87767 

AV557594 Hs.181161 

AV657594 Hs.181161 

AA527548 Hs.7527 

AV656017 Hs.184325 

AW968304 Hs.56155 

AB011151 HsJ34659 

AF095727 Hs.287832 

AA749209 Hs.43728 

Y11192 Hs.5299 

BE251328 Hs.73291 

AA806600 Hs.1 16665 
U49436 

L08595 Hs.78995 

AW836130 Hs.75277 

AW014385 Hs.88678 

U82671 HsJ7698 

F05422 Hs.168352 

AA286914 Hs.1 83239 
BE439944 

AF069291 Hs.40539 

AK001468 Hs.62160 

AA059412 Hs.47986 

AA393254 Hs.43619 

AA148984 Hs.48849 

D14540 Hs.199160 

AF255910 Hs.54650 

BE395161 Hs.1390 

AI658580 Hs.61426 



Cdc42 eflectry protein 3 

Homo sapiens cONA: RJ23020 fis, done L 

ESTs 

ATP binding protein associated wtfli cell 
gb:zn25b03s1 Stratagene neuroepithellum 
tumor necrosis factor receptor superferrd 
WAA0512 gene product ALEX2 
. stannbcaUn 1 

gb2o09a11s1 Stratagene neuroepithellum 
guanfrie nudeotkte binding protein (G pr 
proline synthetase co-transcribed (bacte 
WAA0741 gene product 
ESTs 

hypothetical protein FLJ20093 
hypothetical protein d!511E16^ 
minkiirorncsorre maintenance deficient (S. 
CGt-76 protein 

hypothetical protein FU23471 

uncharacterized hematopoietic stem/proge 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer an tig 

ESTs, Weakly slmflar to T24396 hypoihstl 

Homo saptens mRNA; cDNA DKFZp434J1912 (f 

ATPase, Class I, type 8B, member 1 

hypothetical protein RJ20989 

hypotheflcal protein MGC4308 

sterile-alpha motif and leucine zipper c 

hypothetical protein dJ462023.2 

Human normal keratinocyte mRNA 

ESTs 

ESTs 

ESTs 

WAA0903 protein 

Homo sapiens, done MGC:15887, mRNA, com 

COC14 (cell division cycle 14, S. cerevl 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

senne/threonine kinase 15 

Homo sapiens cONA: FU22120 fis, done H 

ESTs 

nudeosome assembly protein 1-tke 1 

ESTs 

ESTs 

ESTs 

fatfy-acid-Coenzyme A Dgase, long-chain 

ESTs 

ESTs 

ESTs 

Homo sapiens cONA RJ14643 fis, done NT 
ESTs 

smaO fragment nuclease 
CGi-76 protein 
ESTs 

hypothetical protein MGC14139 
myelin protein zero-Ilka 1 
hypothetical protein 

aldehyde dehydrogenase 5 family, member 
hypothetical protein OJ 10881 
WAA1842 protein 
WAA1856 protein 

MADS box transcription enhancer factor 2 
hypothetical protein FU 13910 
ESTs, Weakly slmflar to Unknown [Ksaple 
Target CAT 

nudeoporin-Cke protein 1 

ESTs 

ESTs 

chromosome 6 open reading frame 1 
anflltn (OrosophBa Scraps homo log), act 
hypoftefical protein MGC10940 
ESTs 

ESTs, Weakty similar to AIU4_HUMAN ALU S 
myeloid/lymphoid or mixed-lineage leukem 
|unc8onal adhesion motecufe 2 
proteasome (prosome, macropaln) subunft, 
Homo sapiens mesenchymal stem cell prote 
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115819 AA426573 
409124 AA431418 
115895 AA438182 
458073 AA437099 
115982 AA448585 
115967 AA448887 
115974 AA447224 
115985 AA447709 
129254 AM53624 
446730 AA455044 
116095 AA456045 
426856 AA460454 
116210 AA476494 
116213 AA476738 
432645 AA481422 
116265 AA48259S 
129334 AA485084 
116274 AA485431 
426002 AA489638 
116331 AA491000 
116333 AA491250 
132994 AA505133 
418538 AA598447 
116391 AA599243 
116394 AA599574 
134531 AA600153 
116417 AA609309 
116429 AA609710 
116439 AA610068 
116459 AA621399 
427505 AA621752 
409633 C21523 
116541 D12160 
132557 D19708 
414964 025801 
116571 D45652 
451522 060208 
421919 080504 
116643 F03010 
116661 F04247 
116715 F10966 
116729 F13700 
318709 HO5063 
418999 H16758 
116773 H17315 
116780 H22566 
453884 H48459 
116819 H53073 
427278 H56559 
407833 H57957 

116844 H64938 

116845 H64973 
116892 H69535 
116925 H73110 
116981 H81783 
453133 H86259 
117031 H88353 
117034 H88639 
431129 H88675 
417861 H93708 
117280 N22107 
117344 N24046 
117422 N27028 
117475 N30205 
117487 N30621 
117937 N33258 
130207 N33258 
117549 N33390 
117683 N40180 
117710 N45198 
117791 N48325 
117822 N48913 
422544 N49394 
117895 N50656 
452259 N50721 
133057 N53143 



AA486620 Hs.41135 
AW292809 Hs£0727 
AB033035 Ks.51965 
AA192669 Hs.45032 
AI636361 Hs.179520 
AI745379 Hs.42911 
BE513442 Hs.238944 
AA447709 H&268115 
AA252468 Hs.1098 
BE384932 Hs.64313 
AA043429 Hs.62618 
R19768 HM72788 
8E622792 Hs.172788 
AA292105 Hs.326740 
014041 Hs.347340 
BE297412 Hs.55189 
AW157022 Hs.343551 
AI129767 Hs.182874 
BE514376 Hs.165998 
N41300 Hs.71616 
AF155827 Hs.203963 
AA112748 H1279905 
BE244323 Hs.85951 
T86558 Hs.75113 
NM_006033Hs.65370 
A1742845 Hs.110713 
AW499664 

AF191018 Hs.279923 
AA251594 Hs.43913 
R80137 Hs.302738 
AA361562 Hs.178761 
AW449822 Hs.55200 
D12160 Hs.249212 
AA1 14926 Hs.1 69531 
AA337548 Hs.333402 
D45652 H3211604 
BE565617 HsJ26498 
AJ224901 Hs.109526 
A1367044 Hs, 153638 
R61504 

AL1 17440 Hs.170263 
BE549407 Hs.1 15823 
R52576 H3.285280 
NMJ00121HS-89548 
AJ623410 Hs.343581 
H22566 H3.63931 
AA355925 Hs.36232 
H53073 Hs.93698 
AL031428 Hs.174174 
AW955632 Hs.66666 
H64938 Ha.337434 
AA649530 Hs.348148 
AI573283 Hs.38458 
H73110 HS360603 
N29218 Hs.40290 
AC005757 Hs^1809 
H88353 Hs.347265 
U72209 

AL137751 H5L263671 
AA334551 



M18217 
R 19085 
AI355562 
N30205 
N30621 



Hs.172129 
83^10708 
Ha.43880 
Hs.93740 
Hs.44203 



AF044209 Ha. 144904 

AF044209 Hs.1 44904 

N33390 Hs.44483 
N40180 

N45198 Hs.47248 

N48325 Ks.93956 

AA706282 Hs.93963 

AB018259 Hs.1 18140 
AW450348 Hs.93998 

AA317439 Hs.28707 

AA465131 Hs.64001 



endomudn-2 

N-acetylglucosamlnldase, alpha- (Sanfffi 

NAA1209 protein 

ESTs 

hypothetical protein MGC10702 
ESTs 

hypothetical protein FU10631 

ESTs. WeaWysimte/toT08599 probable 

DKFZp434J1813 protein 

ESTs, WeaWy similar to AF257182 1 G-pro 

ESTs 

ALEX3 proton 
ALEX3 protein 

hypothetkal protein MGC10947 

H-2K binding factor-2 

hypothetical protein 

hypotheflcal protein RJ22584 

guanine nucleotide binding protein (G pr 

PAM mRNA-btrafing protein 

Homo sapiens mRNA; cONA OKFZp586N1720 (f 

hypothetical protein FU 10339 

clone HQ0310PRO031Op1 

exportin, tRNA (nuctear export receptor 

general transcription factor IIIA 

lipase, endothelial 

DEK oncogene (DNA binding) 

Human done 23826 mRNA sequence 

putative nucleotide binding protein, est 

PIBF1 gene product 

Homo sapiens cDNA: FU21425 fis, done C 
26S proteasome-assodated pad1 hcmobg 
ESTs 

polymerase (RNA) ill (DNA directed) (155 
ESTs 

hypoftefical protein M6C12760 

gb:HUMGS02848 Human adult lung $ direct 

hypofteficai protein FU21857 

zinc linger protein 198 

myetoM/rymphold or mixed-Cneage leukem 

gb:yti16a03.s1 Scares infant brain 1NIB H 

tumor protein p53-binding protein, 1 

riboraidease P. 40kD subunit 

Homo sapiens cONA* FU22096 fis, clone H 

eryftropoie&i receptor 

kajyopherfn alpha 1 (importin alpha 5) 

ESTs 

K1AA0186 gene product 
EST 

KIAA0601 protein 

ESTs, WeaMy similar to S19560 proCne-r 
ESTs, Weakly similar to A46010 X-llnked 
gb:ns44fD5.s1 NCt_CGAP_Ahr1 Homo sapiens 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
ESTs 

hypotheficaJ protein 

gb;yw21a02.s1 Morton Fetal Cochlea Homo 

YY1-assodated factor 2 

Homo sapiens mRNA; cONA DKF2p434l0812 (f 

sperm specific antigen 2 

Homo sapiens cONA: FU21409 fis, done C 

Homo sapiens cONA FU13182 fis, done NT 

ESTs. Weakly similar to A46010 Xtnted 

ESTs, Weakly similar to 138022 hypotheti 

ESTs 

nuclear receptor co-repressor 1 
nuclear receptor co-repressor 1 
EST 

gb:yy44d02.s1 SoaresjrtulffptejJderosfs. 
ESTs, Highly similar to similar to Cdc14 
EST 
ESTs 

WAA0716 gene product 
ESTs, Highly similar to SORUiUMAN SORT1 
signal sequence receptor, gamma (transk) 
Homo sapiens clone 2521 8 mRNA sequence 
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118103 N55326 
118111 N55493 
118129 N57493 
118278 N6295S 
118329 N 63520 
118336 N63604 
417098 N64166 

118363 N64168 

118364 N64191 
118475 N66845 
118491 N67135 
118500 N57295 
118584 N68963 
456647 N69331 
118661 N70777 
118684 N71364 
118689 N71545 
11B690 N71571 
118766 N74456 
118793 N75594 
118817 N79035 
118844 N80279 
118919 N91797 
129558 N92454 
407604 N94581. 
118996 N94746 
119021 N98238 
119039 R02384 
119063 R16833 
332622 R41828 
119111 R43203 
415115 R46395 
119146 R58863 
449224 R78248 
119239 T11483 
119281 T16898 
119298 T23820 
126502 T30222 
419983 W15275 
119558 W38194 
429641 W42414 
419445 W49632 
119650 W57613 
119654 W57759 
119683 W61118 
119694 W55344 
119718 W69216 
410365 W69379 
119938 W86728 
120128 Z38499 
120130 Z38630 
120148 Z39494 
120155 Z39623 
451979 Z40071 

120183 Z40174 

120184 Z40182 
120211 Z40904 
120245 AA166965 
120247 AA167500 
120254 AA169599 

120259 AA171724 

120260 AA171739 
120275 AA177105 
120284 AA182626 
417735 AA186324 
422137 AA192099 

120302 AA192173 

120303 AA192415 
120305 AA192553 
120319 AA194851 
408729 AA 195520 

120326 AA196300 
133145 AA19S549 

120327 AA196721 

120328 AA196979 
120340 AA206828 



AA401733 Hs.184134 

N55493 

N57493 

N62955 H3.316433 
N63520 

BE327311 Ks.47166 
AB017365 Hs.173859 
AJ183838 Hs.48938 
N46114 Hs^9169 
N66845 

AV647908 Hs.90424 
W32889 Hs.154329 
AW136928 

AI252640 Hs.110364 
AL137554 Hs.49927 
N71313 Hs.163986 
AW390601 Ks.164544 
N71571 H^269142 
N74456 Hs.50499 
N75594 Ha.285921 
AI668656 Hs.50797 
AL035364 Hs50S91 
AW452696 Hs.130760 
AW580922 Hs, 180446 
AW191962 Hs^88061 
N94746 Hs.274248 
N98238 Hs.55185 
AI160570 Hs.252097 
R16833 Hs.53106 
R10674 

T02865 Hs.328321 
AA214228 Hs.127751 
R58863 Hs.91815 
AW995911 Hs.299683 
T11483 

AI692322 Hs.65373 
NMJX)1241Hs.155478 
T10077 Hs.13453 
W55956. Ks.94030 
W38194 

AW081883 Hs.211578 
AA884471 Hs.90449 
R82342 Hs78856 
W57759 

W65379 Hs.57835 
AA041350 Hs.57847 
W69216 Hs.92848 
A1287516 

AW014862 Hs.58885 
8E379320 Hs.91448 
AA045767 Hs.5300 
F02806 Hs.65765 
Z39623 Hs.65783 
F06972 Hs.27372 
AW082866 Hs.65882 
Z40182 Hs.65885 
Z40904 Hs.66012 
AW959615 Hs.1 11045 
M167500 Hs.103939 
W90403 Hs.111054 
AW014786 Hs.192742 
AK000061 Hs.101590 
AA177105 Hs78457 
AA179656 

AA188175 Hs*2506 
AJ236885 

AA837098 Hs.269933 
AJ216292 H3.96184 
AW295096 Ks.101337 
T57776 Hs.1 81094 
AA195764 Hs72639 
AA198300 Hs.21145 
H94227 Hs.6592 
AKD00292 Hs.130732 
AA923278 Hs.290905 
AA206828 



ESTs 

gb.yv50o02.3l Scares fetal Over spleen 
gteyy54c08.s1 Soaresjnulflpla^scterosis^ 
Homo sapiens cONAFUl 1375 fis, done HE 
gb:yy62ffl1.s1 Scares jnutflple_sderosk_ 
HT021 

frizzled (DrosophBa) homotog 7 
hypothetica]pn:teinHJ21602 
hypotheticaJ protein RJ22623 
gb2a46c11.s1Soares fetal Bver spleen 
Homo sapiens cDNA: FU23285 6s, doneH 
ESTs 

gbiUW«l1-eo><W804Jb1 NCl_CGAP_Su 

peptWytpralyl bomerase C (cydophllin 

protein kinase NYD-SP15 

Homo sapiens cONA: FU22765 6s, done K 

Homo sapiens, done 1MAGE3355383. mRNA, 

ESTs 

EST 

ESTs, Moderately slmEar to T47135 hypot 
ESTs 

hypothetical protein 

myosin phosphatase, target subunlt 2 

karyophertn (importin) beta 1 

cofiagen, type VIII, alpha 2 

hypothetical protein FU20758 

ESTs 

pregnancy specific beta-1 -glycoprotein 6 
ESTs, Moderately sfmllar to ALU 1 .HUMAN A 
CSR1 protein 
EST 

hypothetical protein 
ESTs 

hypothetical protein FU23399 
gb:CHR90049 Chromosome 9 exon Homosapie 
ESTs, Weakly similar to T02345 hypotheti 
cyc0nT2 

hypothetical protein FU14753 
Homo sapiens mRNA; cONA DKFZp586E1624 (f 
Empirically selected from AFFX single pr 
Homo sapiens cDNA: FU23037 ffe, done I 
Human clone 23908 mRNA sequence 
ESTs, Weakly similar to S65657 ataha-1C- 
gbad20g1U1 Soares_fetaLneartNbHH19W 
ESTs 

ESTs, Moderately similar to ICE4_HU.\WN C 
ESTs 

Homo sapiens mRNA; cDNA OKFZp586D0923 (f 
ESTs 

MKP-1 Gke protein tyrosine phosphatase 
bladder cancer associated protein 
ESTs 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

EST 

EST 

ESTs 

EST 

ESTs 

hypothetical protein FU12785 

hypothetical protein 

solute carrier family 25 (mitochondrial 

gb2p54e1 1 ,s1 Stratagene NT2 neuronal pr 

WAA1254 protein 

zinc finger protein 148 (pHZ-52) 

ESTs 

ESTs 

uncoupling protein 3 (mitochondrial, pro 

ESTs 

ESTs 

hypothetical protein RG083M052 

Homo sapiens, done IMAGE2961368, mRNA, 

hypofiietical protein FU20285 

ESTs, Weakly similar to protease (H.sapl 

gb2q80b08j1 Stratagene hNT neuron (937 
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417122 AA207123 
131522 M214539 
421787 AA226914 

120375 AA227260 

120376 AA227469 
120390 AA233122 
410804 AA233334 
434223 AA233347 
312771 AA233714 
120396 AA233796 
120409 AA235050 
120414 AA235704 
120420 AA236031 

120422 AA236352 
419326 AA236390 

120423 AA236453 
120435 AA243370 
120453 AA250947 

120455 AA251083 

120456 AA251113 
120473 AA251973 
128922 AA252023 
120477 AA252414 
120479 AA252650 
120488 AA255523 
120510 AA258128 

120527 AA262105 

120528 AA262107 

120529 AA262235 
120541 AA278298 
120544 AA278721 
120562 AA280036 
120569 AA260648 
120571 AA280738 
"120572 AA280794 
129434 AA280837 
130529 AA280886 
120575 AA280934 
409339 AA281535 
120591 AA281797 
120593 AA282047 
430275 AA283002 
440303 AA283709 
120609 AA283902 
409702 AA284108 
456870 AA284109 
132614 AA284371 
458750 AA284744 
135376 AA284784 
120621 AA284840 
452279 AA266844 
332484 AA287032 
120644 AA287036 

120660 AA287546 
135370 AA287553 

120661 AA287556 
429828 AA287564 
452291 AA291015 
120699 AA291716 
100690 AA291749 
120726 AA293656 
120737 AA302430 
120745 AA302809 
443574 AA302820 
120750 AA310499 
120761 AA321890 

120768 AA340589 

120769 AA340622 
135232 AA342457 
120793 AA342864 
120796 AA342973 
120809 AA346495 
332633 AA347573 
120825 AA347614 
120827 AA347717 
120839 AA348913 



At906291 Hs.81234 
A1380040 Hs239489 
AA227068 Hs.108301 
AF028706 Hs.111227 
AA227469 

AA837093 Hs.111460 
U64820 Ha.66521 
A1825842 Hs^776 
AA018515 Hjl264482 
AA134008 Ks.79306 
AA235050 

AW137156 Hs.181202 
AI128114 Hs.112885 
AL133097 Hs31717 
W94915 Hs.42419 
AA236453 Hs. 18978 
AA243370 Hs.98450 
AA250947 Hs.170263 
AA251720 Hs.104347 
AA488750 Hs.88414 
AA251973 Hs^69988 
A1244901 H&9589 
AA252414 Hs.43141 
AF006689 Hs. 110299 
AW952916 Hs.63510 
AI796395 Hs.111377 
AA262105 Hs.4094 
A1923511 Hs.104413 
A1434823 Hs.104415 
W07318 Hs.240 
BE548277 Hs.103104 
BE244580 Hs.342307 
AA807544 H&24970 
AB037744 Hs.34892 
H39599 Hs-294008 
AW967495 Ha. 186644 
AA178953 Hs.309648 
AW978022 Hs238911 
AB020686 Hs£4037 
AF078847 Hs.191356 
AA748355 Hs. 193522 
211773 Hs.237786 
AA306166 Hs.7145 
AW978721 Hs366076 
AI752244 

AJ241084 Hs.154353 
AA284371 Hs.118064 
AA115496 Hs336898 
BE617856 Hs.99756 
AW961294 Hs.143818 
AA286844 Hs.61260 
AW172431 Hs.13012 
AI869129 Hs.96616 
AA286785 H3.99677 
6E622187 Hs.99670 
AA287556 Hs263412 
AB019494 Hs.225767 
AF015592 Hs28853 
AI683243 Hs.97258 
AA383256 Hs.1657 
AA293655 Hs21t98 
AL049176 Hs.82223 
AA302809 

U83993 Hs.321709 
AJ191410 Hs.96693 
AA321890 

AA340589 Hi104560 
AI769467 Hs.9475 
AL038812 Hs.96800 
AA342864 Ks.96812 
AI247356 Hs.96820 
AA346495 

AL120071 Hs.48998 
AJ280215 Hs.96885 
AA382525 Hs. 132967 
AA348913 



immunoglobulin superfamlly, member 3 
TIA1 cytotoxic granutenassodated RNA-W 
nuclear receptor subfam2y 2, group C, m 
Zlc family mentor 3 (odc^paired Drosophl 
gb:zr18307^1 Stratagene KT2 neuronal pr 
caldiffrVcataodufin-o^pendent protein kin 
Machado-Joseph disease (splnocerebeflar 
zfncftigerprotebi216 

Homo sapiens mfiNA; cDNA OKFZp761A041 1 (f 
eukaryotic translation initiation factor 
gtns38e04.s1 Soares_NhHMPu_S1 Homo sapi 
hypothetical protein RJ 10038 
spinal cord-derived growth factor-B 
hypothetical protein DKF2p434N1928 
ESTs 

Homo sapiens cDNA: RJ22822 fis, clone K 
EST 

tumor protein p53*bfcidlng protein, 1 
ESTs, Weakly similar to ALUC_HUMAN 111! 
BTB end CNC homology 1, basic leucine zi 
ESTs 
ub iqullinl 

OKFZP727C091 pratetn 
mitogejvadivated protein kinase kinase 
KIAA0141 gerte product 
ESTs 

Homo sapiens cDNA FU14208 fis, done NT 

ESTs 

ESTs 

M-phase phosphopratein 1 
ESTs 

hypothetical protein FU 10330 

ESTs, Weakly similar to B34323 GTP-bindJ 

WAA1323 protein 

ESTs 

ESTs 

gb:zp39e03.s1 Stratagene muscle 937209 H 
hypothetical protein DKFZp762E151 1 ; WAA 
ectonudeotide pyraphosphatase/phosphodi 
general transcription factor IIH, polype 
ESTs 

zinc finger protein 187 
caJpain 7 

ESTs, Weakly similar to A46010 X-finked 

eukaryotic translation elongation factor 

nonselective sodium potassium/proton exc 

similar to rat nuclear ubiquitous casein 

Homo sapiens, Similar to RIKEN cDNA 1810 

mitochondrial ribosome recycling factor 

hypothetical protein FU23459 

hypothefcal protein FU13164 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to I38022 hypotheti 
ESTs, Weakly similar to ALUB JWMAN 1UI 
IDN3 protein 

COC7 {cell division cycle 7, S. cerevisi 
ESTs, Moderately similar to S29539 ribos 
estrogen receptor 1 
ESTs 

chonftttia 

gb£ST10426 Adipose tissue, white I Homo 
purtaerglc receptor P2X, Rgand-gated to 
ESTs, Moderately similar to 2109260A B c 
branched chain keto add dehydrogenase E 
EST 
ESTs 

ESTs, Moderately similar to ALU7_HUMAN A 

ESTs 

ESTs 

gb:EST52657 Fetal heart II Homo sapiens 
flbronectin leucine rich transmembrane p 
ESTs 

Human EST done 122887 mariner transposo 
gb:EST55442 Infant adrenal gland II Homo 
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120850 


AA349647 


AA349647 


Hs.96927 


Homo sapiens cDNA FU12573 fis, done NT 


120852 


AA349773 


AA349773 


Hs.191564 


ESTs 


128852 


AA350541 


R40622 


Hs. 106601 


ESTs 


135240 


AA357159 


AA357159 


HsJ6988 


EST 


120870 


AA357172 


AA357172 


Hs.292581 


ESTs, Moderately simlar to ALU1_HUMAN A 


120894 


AA370132 


AA370132 


HaJ7063 


ESTs 


435737 


AA370472 


AF22S839 


Hs.173202 


Wcappa-B-interacflng Ras-Eka protein 1 


120897 


AA37GBo7 


AA370867 


Hs.97079 


roT- » ' j . ,,i..,tii.. i. « «7/cnc 4 e 

ESTs, Moderately similar to Ar 174605 1 F 


120915 


AA377298 


All 35558 


Hs.97104 


ESTs 


120935 


AA383902 


AL0484Q9 


Hs.97177 


ESTs, Weawy stmBar to ALU1_HUMAN ALU S 


120936 


AA385934 


AA385934 


Hs.97184 


EST, Highly simitar to (defltne not aval 


120937 


AA386255 


AA3S6255 


Hi97188 


EST 


120938 


AA386260 


AA386260 


Hs. 104632 


EST 


417632 


AA386266 


R20855 


H&5422 


glycoprotein M6B 


120960 


AA398014 


AA398014 


Hs.104684 


EST 


120985 


AA398222 


AJ219896 


Hs.97592 


ESTs 


120988 


AA398235 


AA398235 


Hs.97631 


ESTs 


121008 


AA398348 


AA398348 


Hs.130546 


Human DNA sequence from done RP1 1-251 J8 


121029 


AA398482 


AA398482 


Hs.97841 


EST 


121032 


AA398504 


AA393037 


Hs. 161 798 


ESTs 


121033 


AA398505 


AA398505 


Hs.97360 


ESTs 


121034 


AA398507 


AL389951 


Hs271623 


nucieoporinSOkD 


121035 


AA398523 


AA398523 


Hs^10579 


ESTs 


121058 


AA398625 


AA398625 


Hs.97391 


ESTs 


121060 


AA398632 


AA398632 


Hs.97395 


ESTs 


121061 


AA398633 


AA393288 


Hs.97396 


ESTs 


121091 


AA398894 


AA398894 


Hs.97657 


ESTs, Moderately similar to ALU8_HUMAN A 


121092 


AA398895 


AA398895 


Hs.97658 


EST 


121094 


AA398900 


AM02505 




gb:2tfi2h10j1 Scares JesusJJHT Homo sap 


121096 


AA398904 


AA398904 


Hs.332690 


ESTs 


121115 


AA399122 


AA398187 


Hs.104682 


ESTs, Weakly slmter to iritochondrfal d 


121121 


AA399371 


AA399371 


Hs.189095 


similar to SALL1 (sa) (Drosophila)-Dke 


121122 


AA399373 


AI126713 


Hs. 192233 


ESTs, Highly simflar to T00337 hypothec 


121125 


AA399441 


AL042981 


Ha251276 


KIAA1201 protein 


121151 


AA399636 


AA339636 


Hs.143629 


ESTs 


121153 


AA399640 


AA399640 


Hs.97694 


ESTs 


121163 


AA399680 


A1676062 


Hs.111902 


ESTs 


121176 


AA400080 


AL121523 


Hs.97774 


ESTs 


121192 


AA400262 


AA400262 


Hs. 190093 


ESTs 


121223 


AA400725 


AI002110 


Hs.97169 


ESTs, Weakry simflar to dJ667H12.2.1 (U 


121227 


AA4QQ748 


AA400748 


Hs.97823 


Homo sapiens mRNA; cDNA DKFZp434D024 (fr 


121231 


AA40078O 


AA814948 


Hs.96343 


ESTs. Weakry simDar to ALUC.HUMAN Ilii 


121278 


AA401631 


AA037121 


Hs.98518 


Homo sapiens cONA FU1 1490 fis, done HE 


121279 


AA401688 


AA292873 


Hs.177996 


ESTs 


121282 


AA401695 


AA401695 


Hs.97334 


ESTs 


121299 


AA402227 


AA402227 


HS22826 


tropomodulin 3 (ubiquitous) 


121301 


AA402329 


NM_006202Hs.89901 


phosphodiesterase 4A, cAMP -specific (dun 


121302 


AA402398 


AA402587 


Hs.325520 


LAT1-3TM protein 


121304 


A k AM A AO 

AMQZ449 


AA293863 


Hs.97316 


EST 


121305 


AA402468 


AA402468 


Hs.291557 


ESTs 


134721 


AA403268 


AK000112 


Hs.89306 


hypothetical protein FU20105 


121323 


AA403314 


AA291411 


Hs.97247 


ESTs 


121324 


AA404229 


AA404229 


Hs.97842 


EST 


444422 


AA404260 


A1768623 


Hs.108264 


ESTs 


131074 


AA404271 


U16125 


Hs.181581 


giuiamatB receptor, lonotropic, kalnate 


121344 


AA405026 


AA405026 


Hs.193754 


ESTs 






AA405182 


Hs.97973 


tots 


121350 


AA405237 


AA405237 




gb2t06e10^lNa_CGAP_GCB1 Homo sapiens 


121400 


AA406061 


AA406061 


Hs.98001 


EST 


121402 


AA406063 


AA406063 


Hs.98003 


ESTs 


121403 


AA406070 


AA406070 


Hs.98004 


EST 


121408 


AA406137 


AA406137 


Hs.98019 


EST 


121431 


AA406335 


AA035279 


Hs.176731 


ESTs 


121471 


AM 11804 


AA411804 


Hs261575 


ESTs 


121474 


AA411833 


AA402335 


Hs.188760 


ESTs, Highly simDarto Trad fH-sapiens] 


121526 


AA412219 


AW665325 


Hs.98120 


ESTs 


121530 


AA412259 


AA778658 


Hs.88122 


ESTs 


121558 


AM 12497 


AA412497 




gbzt95g1Zs1 Soares_testi3_NHT Homo sap 


121559 


AA412498 


A1192044 


Hs.1 04778 


ESTs 


121584 


AM16586 


AI024471 


Hs.98232 


ESTs 


121609 


AA416867 


AA416867 


Hs.98185 


EST 


121612 


AA4 16874 


AM16874 


Hs.98168 


ESTs 


121737 


AM21133 


AM21133 


Hs.104671 


erythrocyte transmembrane protein 


121740 


AM21138 


AM21138 


Hs.143835 


EST 


436032 


AM22079 


AA 150797 


Hs.109276 


iatexln protein 


121784 


AM23837 


T90789 


Hs.94308 


RAB35, member RAS oncogene family 
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121802 AA424328 A1251870 Hs. 188898 ESTs 

121803 AA424339 AI338371 Hs.157173 ESTs 
135286 AA424469 - AW023482 Hs.97849 ESTs 
332778 AA424469 AW023482 Hs.97849 ESTs 

5 121808 AA424502 AA424313 HsJ3402 ESTs 

129517 AA425004 AW972853 Hs. 11 2237 ESTs 

121845 AA425734 A1732692 Hs.165066 ESTs, Moderately similar to AUfiLHUMAN A 

121853 AA425887 AA425887 Hs.93502 hypothetical protein FU14303 

121891 AA426456 AA426456 H&98469 ESTs 

10 121895 AA427396 AA427398 gb:zw33a02,s1 Soares ovary tumor NbHOT H 

121899 AA427555 R55341 H&50421 WAA0203 gene product 

121917 AA428216 AA406397 HsJ39425 ESTs 

121918 AA428242 BE274689 Hs, 184175 dvomosomo 2 open reading fame 3 

121919 AA428281 AA428281 Hs.98560 EST 
15 121941 AA428865 AA428865 Hs,98563 ESTs 

121942 AA428994 AW452701 Hs^93237 ESTs 

121970 AA429666 AA429668 Hs.98617 EST 

121993 AA430181 AW297880 Hs.98661 ESTs 

418706 AA430184 U73524 HsJ7465 ATP/GTP-blnding protein 

20 122022 AA431293 AA431293 Hs.88716 ESTs, Moderately similar to T42650 hypot 

122050 AA431478 AM53076 ELAV (embryonic lethal, abnormal visbn, 

122051 AA431492 AA431492 Hs.98742 EST 
122055 AA431732 AA431732 H&98747 EST 
122105 AA432278 AW241685 H&98699 ESTs 

25 '122125 AA434411 AKQ00492 Ks.98806 hypothetical proteh 

135235 AA435512 AW298244 Hs^66195 ESTs 

122162 AA435698 AA628233 Hs.79946 cytochrome P45Q, subfamily XIX (aromatiz 

422072 AA435711 AB0 18255 Hs.111138 WAA0712 gene product 

415106 AA435815 U40763 Hs.77965 peptkJyt-prolyl Isomerase G {cydophllm 

30 122186 AA435842 AA398811 Hs.104673 ESTs 

122235 AA436475 AA436475 Hs.112227 membrane-associated nucleic add binding 

412970 AA436489 AB026436 Hs. 177534 dual specificity phosphatase 10 

419288 AA442060 AA256106 Hs.87507 ESTs 

122310 AA442079 AW192803 Hs.98974 ESTs. Weakly slmDar to S65824 reverse t 

35 122334 AA443151 BE465894 Hs.98365 ESTs, Weakly similar to LB40_HUMAN NADP- 

122382 AA446133 AA446440 Hs.98643 ESTs 

122425 AA447145 A8007859 Hs.100955 K1AA0399 protein 

122431 AA447398 AA447398 Hs.99104 ESTs 

122450 AA447643 AA447643 Hs, 112095 hypothefcal protein DKFZp434f 1819 

40 426284 AA447742 AJ404468 Hsuf84259 dynein, axonemal, heavy polypeptide 9 

122477 AA448226 AA448226 Hs.324123 ESTs 

122500 AA448825 AA448825 Hs.99190 ESTs 

122522 AA449444 AA299607 Hs.98969 ESTs 

122536 AA450087 AF060877 Hs.99236 reguIatorofG-proteln signalling 20 

45 122538 AA450211 AA450211 Hs.99239 ESTs 

122540 AA450244 AA476741 Hs.98279 ESTs, WeaWy similar to A43932 mucin 2 p 

122560 AA452123 AW392342 Hs.283077 centrosomaJ P4.1 associated protein; unc 

421919 AA452155 AJ224901 Hs.109526 zinc finger protein 198 

122562 AA452156 AA452156 gb3x29c03.s1 Scares JotaLfetusJ4b2HF8_ 

50 122585 AA453036 AI681654 Hs. 170737 hypothetical protein FU23251 

122608 AA453526 AA453525 Hs.143077 ESTs 

122635 AA454085 AA454085 gbzx33a08.s1 SoaresjDtaUetusJ4b2HF8_ 

122636 AA454103 AW651706 Hs.99519 hypothetical protein FU 14007 
122653 AA454642 AW009166 Hs.99376 ESTs 

55 122660 AA454935 A1B16827 Hs.180059 nudear respiratory factor 1 

122703 AA456323 AA456323 Hs.269369 ESTs 

122724 AA457395 AA457395 Hs.99457 ESTs 

122749 AA458850 AA458850 Hs.293372 ESTs, Weakly simitar to B34087 hypothetl 

122772 AA459662 AW117452 Hs.99489 ESTs 

60 430242 AA459668 U66669 Hs^36642 34iycVnxyisobutyiyl-Coenzyme A hydrolase 

429838 AA459879 AW904907 Hs.30732 hypothetical proteki FU 13409; K1AA1711 

122777 AA459702 AK001022 Hs.214397 hypothetical protein FU 10160 simitar to 

135362 AA460017 AA978128 Hs.99513 ESTs, Weakly similar to T1 7454 diaphano J 

122798 AA460324 AW366286 Hs, 145696 sp&dng factor (CC1.3) 

65 122837 AA461509 AA461509 Hs293565 ESTs, Weakly similar to putative p150[H 

122860 AA464414 AA464414 gbzx78g01 ^1 Soares ovary tumor NbHOT H 

122861 AA464428 AA335721 Hs.213628 ESTs 
122910 AA470084 AA470084 Hs.98358 ESTs 

132899 AA476606 AA476606 Hs.59666 SMAO b (ho anOsense orientation 

70 122967 AA478521 AA806187 Hs.289101 glucose regulated protein, 58kD 

422845 AA478523 AA317841 Hs.7845 hypothetical protein MGC2752 

123009 AA479949 AA535244 Ks.78305 RAB2, member RAS oncogene family 

128917 AA481252 AI365215 HS.2G6097 oncogene TC21 

123081 AA485351 A1815466. HS.2439Q1 Homo sapiens cONA FU20738 fis, done HE 

75 123133 AA487264 AA487264 Hs.154974 Homo sapiens mRMA; cONA DKFZp667N064 (fr 

123184 AA489072 BE247767 Hs.18166 KlAA0870p 
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332467 AA489630 

123233 AA490225 

123234 AA490227 
123238 AA490255 

5 123255 AM30890 

430015 AM90916 

448892 AA490925 

123259 AA490955 

123284 AA495812 

10 123286 AA495824 

123315 AA498369 

457397 AA504125 

433049 AA521473 

123421 AA596440 

15 123449 AA598899 

426981 AA599244 

409986 AA593694 

123497 AA600037 

123604 AA609135 

20 123712 AA609684 

123731 AA609839 

123800 AA620423 

123841 AA620747 

123929 AA621364 

25 123978 C20653 

133184 D20085 

132835 D20749 

43S147 D51285 

128695 D59972 

30 124029 F04112 

124057 F13604 

449316 H01662 

130973 H05135 

124106 H12245 

35 124136 H22842 

124165 H30894 

429627 H43442 

124178 H45998 

129948 H69281 

40 452114 H69485 
124f0825254 

129056 H7Q627 

427580 H73260 

426793 H77531 

45 124274 H80552 

129078 H80737 

457656 H93412 

124315 H94892 

437712 H95643 

50 124324 H96552 

452933 H97146 

132231 H99131 

421877 H99462 

443123 H99837 

55 132963 N22140 

420473 N22197 

417381 N23756 

130365 N24134 

456610 N24195 

60 439311 N26739 

124383 N27098 

124387 N27637 

129341 N33090 

419793 N35967 

65 124433 N39069 

124441 N46441 

132338 N48270 

436575 N46355 

124466 N51316 

70 408048 N51499 

124483 N53976 

124484 N54157 

124485 N54300 
124494 N54631 

75 129200 N59849 

124527 N62132 



NNL014700HS.1 19004 
AW974175 H8.151875 
NNL001938HS.16697 
AW968504 Hs.123073 
AA830335 Ks. 105273 
AW768399 Hs.106357 
AF084535 Hs.22464 
AI744152 Hs.283374 
AA488988 Hs.293796 
AA495824 his. 188822 
AA496369 

AW969025 Hs.109154 
AU076668 Hs.334884 
AA596440 Hs.291154 
AUM9325 Hs, 112493 
AU044675 Ha.173081 
NM_014777Hs^7730 
AA765256 Hs.135191 
AA609135 Ha.293076 



AA609839 Hs.334437 

AA620423 Hs. 112862 

AA620747 Hs.1 12896 

AA621364 Hs.1 12981 

T89832 Hs.170278 

AA001021 Hs.6685 

Z63844 Hs.5790 

AL133731 Hs.4774 
NM_O03478Hs.101299 

F04112 Hs.312553 

AA902384 Hs.73853 

A1609045 Hs.321775 

AI638418 Hs.1440 
H12245 

H22842 Hs.101770 

H30039 Hs.107674 
NM_015340Hs.2450 

BE463721 Hs.97101 

AI537162 Hs.263988 
Hs.8236 



AJ76995B Hs.108336 
AK001507 Hs.44143 
X89887 Hs.172350 
H80552 Hs.102249 
AI351010 Hs.102267 
AW952124 Hs.13094 
NIVL005402HS.288757 
X04588 H3.85844 
H96552 Hs.159472 
AW391423 HS.2B8555 
AA662910 Hs.42635 
AW250380 Hs.109059 
AA094538 Hs.272808 
AA099693 Hs.34851 
AL118782 Hs.300208 
AF164142 Hs.82042 
W56119 Hs.155103 
AF172066 Hs.106346 
BE270668 Hs.151945 
N27098 Hs.102463 
N27637 Hs.109019 
AI193519 Hs.226396 
AI364933 Hs.168913 
AA280319 Hs.288840 
AW450481 Hs.161333 
AA353868 Hs.1 82982 
AI473114 

R10084 Hs.113319 
NM_007203H342322 
AI821780 Hs.179864 
H66118 Hs.285520 
AB040933 Hs.15420 
N54831 Hs.271381 
N59849 Hs.13565 
N79264 Hs.269104 



K1AA0665 gene product 

ESTs, Weakly similar to MAPB_HUMAN MICRO 

dowiHegubtorof transcription 1, TBP-b 

CDC2+eiated protein kinase 7 

ESTs 

ESTs 

epilepsy, progressive myoclonus type 2, 
ESTs, Weakly Simla; to CA15_HUMAN COLLA 
ESTs 

ESTs, Weakly slmSar to A46010 X-Onked 
gb:zv37d10s1 Scares ovary tumor NbHOT H 
ESTs 

SEC10{S.cerevtsiaeHIka1 

EST, Weakly similar to 138022 hypotheCc 

Homo sapiens mRNA; cDNA DKF2p564D036 (fr 

K1AA0530 protein 

K1AA0133 gene product 

ESTs, Weakly similar to unnamed protein 

ESTs 

Homo sapiens cONA; FU21543 Gs, done C 

gbae62»1 J1 Stratagena lung carcinoma 

EST 

ESTs 

ESTs 

ESTs 

thyroid hormone receptor Interactor 8 

hypothetical protein dJ37E16.5 

Homo sapiens mRNA; cONA DKFZp761C1712 (f 

cuEn5 

gb:HSC2JH062 normalized Infant brain cDN 
bone morphogenetlc protein 2 
hypothetical protein DKFZp43401428 
DEAO/H (Asp-GiuVUa^\sp/His) box polypep 
gb:ym17a12.r1 Scares Infant brain 1MB H 
EST 
ESTs 

leucyMRNA synthetase, mitochondrial 
putative 6 protein-coupled receptor 
ESTs 
ESTs 

gfcyu70c12*1 Wetonann Olfactory Epithel 
ESTs, WeaJdy simflar to AIUElHUMAN !l!l 
Homo sapiens clone FLB5914 PR01821 mRNA, 
H1R (histone cell cycle regulation defec 
EST 

lysosomal 

presenilis associated rhombold-Gke pro 
v-ra) simian leukemia viral oncogene horn 
neurotrophic tyrosine kinase, receptor, 
Homo sapiens cDNA: FU22224 Ms, clone H 
Homo sapiens cDNA: FU22425 fls, clone H 
hypothetical protein DKFZp434K2435 
mitochondrial ribosoma) protein L12 
putafJve transcription regulation nudea 
epsIton-tubuHn 

Sec23-interacflng protein p125 
solute carrier family 23 (nucteobase tra 
eukaryotic translation initiation factor 
retinolc acid represstbte protein 
mitochondrial ribosomal protein L43 
EST 
ESTs 

hypothetical protein RJ11126 

serine/threonine kinase 24 (Ste20, yeast 

PR01575 protein 

ESTs 

gokjin-67 

ESTs 

ktnesin heavy chain member 2 
A kinase (PRKA) anchor protein 2 
ESTs 

ESTs, Weakly similar to 2109260A B cell 
KIAA1500 protein 

ESTs, Weakly similar to 138022 hypotheU 
Sam68-Gke phosphotyroslne protein, T-ST 
ESTs 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



124532 N62375 
133213 NS3138 
124539 N63172 
129196 N537S7 

124575 N58168 

124576 N88201 

124577 N68300 

124578 N68321 
124593 N69575 
128501 N75007 
332434 N75542 
128473 K90066 
128639 N91246 
124652 N92751 
133137 N93214 
124671 N99148 
133054 R07876 
425266 R10865 
124720 R11056 
124722 R11488 
128944 R23930 
132965 R26589 
426504 R37588 
438828 R37613 
124757 R38398 
124762 R39179 
124773 R40923 
135266 R41179 
427961 R41294 
414303 R42307 
128540 R43189 
124785 R43306 

124792 R44357 

124793 R44519 
124799 R45088 
124812 R47948 
124821 R51524 
424123 R54950 
124835 R55241 
124845 R59585 
124847 R60044 
440630 R60872 
124861 R66690 
332503 R67266 
124879 R73588 
124892 R79403 
124906 RS7647 
124922 R93622 
•124940 R99599 
124941 R99612 
124943 T02888 
124947 T03170 
124954 T10465 
456662 T15418 
410653 T15597 
418133 T15652 
440014 T16898 
131082 T26644 
124980 T40841 
124984 T47566 
124991 T50116 
457222 T50145 
125000 T58615 
132932 T59940 
444484 TB3595 

125008 T64891 

125009 T64924 
445384 T64933 

125017 T68875 

125018 T69027 
125020 T69924 
437871 T70353 
134204 T79780 
125050 T79951 
125052 T80174 
125054 T80622 



N62375 Hs.102731 
AA903424 Ks.6786 
D54120 Hs.146409 
BE296313 Hs.265592 
N68168 
N68201 

N68300 Hs.138485 
N68321 Hs.231500 
N69575 Hs.102788 
AL133572 Hs.199009 
A1680737 Hs269068 
T78277 Hs.100293 
AW582962 Hs.102897 
W19407 r&3862 
AB002316 Hs.65746 
AK001357 Hs.102951 
AA464636 Hs291079 
J00077 Hs.155421 
R052S3 

T97733 Hs.185685 
AL137586 Hs.52763 
AI248173 Hs.191460 
AW162919 Hs.170160 
AL134275 Hs.6434 
H 11368 Hs.141055 
AA553722 H3.92096 
R45154 Hs.338439 
R41179 Hs.97393 
AW293165 Hs.143134 
NM_004427Hs.165263 
AW297929 Hs.328317 
W38537 Hs.280740 



R44357 
R44519 
R45088 
R47948 
H87832 



Hs.48712 



Hs.188732 
Hs.7383 
AW96615S Hi 58582 
R55241 Hs.101214 
R59585 Hs.101255. 
W07701 Hs.304177 
BE561430 Hs.239388 
R67567 Ha.107110 
NM_004455Hs.15O956 
R73588 Hs.101533 
AI970003 Hs.23756 
H75964 Hs.107815 
R93622 Hs.12163 
AF068846 Hs.103804 
AI766661 Hs.27774 
AW963279 Hs. 123373 
T03170 Hs.100165 
AW964237 Hs.6728 
U55184 Hs.154145 
BE383768 Hs.65238 ■ 
R43504 Hs.6181 
AW960782 Hs.6858 
A/091121 Hs.246218 
T40841 Hs.98681 
BE313210 Hs.334798 
T50116 

NMj0O4477Hs.2O3772 
T58615 Hs.235887 
AW1 18826 Hs.6093 
AK002126 Hs.11260 
T91251 

T64924 Hs.303046 
T79136 Hs. 127243 
T68875 

T69027 Hs.269481 
T69981 

AI084813 Hs. 114088 
AI873257 Hs.7994 
AW970209 Hs.111805 
T85104 Hs.222779 
T80622 Hs-268601 



EST 
ESTs 

cell division cycle 42 (GTP-bfodlng prot 
ESTs, Weakly similar to D8022 hypotheti 
gbza11c01.s1 Soares fetal Over spleen 
ESTs Weakly similar to 138022 hypothet] 
gb2a12g07s1 Scares fetal fiver spleen 
EST 
ESTs 

protein conlakibg CXXC domain 2 
Homo sapiens cONA RJ1 1916 fis; clone HE 
O-Onked N-acetylgluoosamine (GtcNAc) tr 
C6M7 protein 

regulator of nonsense transcrfots 2; DKF 
K1AA03 18 protein 

Homo sapiens cONA FU10495 fis, done NT 
ESTs, Weakly similar to T27173 hypotheti 
alpha-fetoprotein 

gb:ye91c08.3l Scares fetal fiver spleen 
ESTs 

anaphase-prorroting cxxnpiex subunft 7 

hypothetical protein MGC12936 

RAB2, member RAS oncogene family-Bee 

hypothetical protein DKFZp761F2014 

Homo sapiens done 23758 mRNA sequence 

ESTs, Moderately similar to A46010 X-Cn 

ESTs 

WAAQ328 protein 
ESTs 

early development regulator 2 (homolog o 
EST 

hypofliefica! protein MGC3040 
hypolhetJcal protein FU20736 
gb:yg24h04s1 Scares Infant brain 1NIB H 
g b:yg38g04.s1 Scares infant brain 1NIB H 
ESTs 

ketch (Drosoptifla>-Cke 3 

Homo sapiens cDMA FIJ12789 fis, done NT 

EST 

ESTs 

Homo sapiens done FLB8503 PR02286 mRNA, 
Human DNA sequence from done RP1-304B14 
ESTs 

exostoses (mufiipJeHflce 1 
ESTs 

hypothetical protein similar to swine ac 
ESTs 

eukaryotic translation initiation factor 
heterogeneous nudear rioonudeoprotein 
ESTs, Highly similar to AF161349 1 HSPCO 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
ESTs 

WAA1548 protein 

hypothetical protein FU 11 585 

95 kDa retinoblastoma protein binding pr 

ESTs 

ash2 (absent, small, or homeotic, Drosop 
Homo sapiens cDNfc RJ21781 fis, done H 
ESTs 

eukaryotic translation elongation factor 
gb^b77c10s1 Stratagene ovary (937217) 
FSHD region gene 1 
ESTs 

Homo sapiens cDNA: FU22783 fis, done K 
hypothetical protein HJ1 1264 
gteyd60a10s1 Scares fetal Over spleen 
ESTs 

Homo sapiens mRNA for KIM 1724 protein, 
gtKyc30f05.s1 Stratagene fiver (937224) 
sex comb on mid teg homolog 1 
gteyc19d03j1 Stratagene lung (937210) H 
ESTs 

hypothetical protein FU20551 
ESTs 

ESTs, Moderately similar to similar to N 
ESTs. Weakly similar to envelope [H.sapi 
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125083 T85352 


T85352 






T85373 




125068 T86284 


T86284 




416507 T89579 


AL045384 


Hs79353 


125080 TQO360 


T90360 


Hs.268620 




AW576389 Hs335774 


125104 TQS590 


T955SD 




115107 TQ7957 


T97257 


Hs.94560 


bYKXT) TQ7«iQ0 


AA845462 Hs.1 24024 


12511ft T07fi9n 
l/OMO 13/0/U 


R10606 


Hs. 269890 


49fi19rt T0777H 
IZOI/U J3///0 


T97775 


Hs.100717 


104 I0U 13010/ 


T98152 


Hs.79432 


1 £9 IOO WO 11/ 3 


AW962364 Hs. 129051 


195144 WT7QOQ 

i ii4 wo/yyy 


AB037742 


Hs.24336 


J/9I3U VY00/4V 


W38240 




450142 W40150 


AW207469 Hs.24485 


111987 W45435 


AW453069 


H3.3657 


125178 W58209 


W93127 


HsJ1845 


125180 W58344 


W58469 


Hs.103120 


195182 W<%885n 


AA451755 


Hs.263560 


448888 WR87T8 


AL030996 


Ks.16411 


125147 WM108 

1 t^J 19/ »VU3 1 WW 


AF086270 


Hs.278554 


111407 WRQ111 
100*13/ VVD3 1 1 1 


BE617303 


Hs.74266 


derail woyoyy 


Z97630 


Hs.226117 


i 909W UUftQ JKQ 

1/3/0/ woiwoy 


R98881 


Hs.109655 


A991Rfi U/70yf)/1 
4//1D0 W/ £1/1 


W72424 


Hs.1 12405 


1959TW W79794 


W72724 


Hs.103174 


19H9<9 14/70644 


AA746225 


Hs.103173 


490001 W/0399 


BE383436 


Hs.108847 


1/9 //o W/1/U1 


AJ916269 


Hs.109057 


40C90C U/7CC4A 


W74169 


Hs.16492 


19599ft \A/7t31Q7 

v/o//o w/yja/ 


AA033982 


Hs.110059 


10/030 WoOOao 


AL135094 


Hs.47334 


1/9/00 WoOUOo 


N99713 


Hs.109514 


125247 WHfiflfll 
1/0/4/ W00001 


AA694191 


Hs.163914 


1/3/30 WO/ 0U1 


AI051967 


Hs.110122 


195981 WRA.Q49 
1/0/00 W 007*1/ 


AA098878 




i9^0ftft u/onn99 

1/3/00 W9UUZZ 


W90022 


Hs.186809 


49U00/ W3/// / 


U91543 


Hs.25601 


10/ Wl W9//04 


NM_007115Hs^9352 


A9S943 wairtdn 
4/0/10 wyowu 


H05317 


Hs283549 


1/9// / W90/// 


W93227 


Hs.1 03245 


1/9// 0 ¥¥309/0 


AI218439 


Hs.129998 


i/o /ou wyjooy 


AI123705 


Hs.105932 


no/uo wvynjuo 


W93949 


Hs.33245 


131844 W94401 


AJ419294 


Hs.324342 


125284 W94688 


NM.002666HS.103253 


417111 W94787 


AW016321 Hs.82306 


445424 Z38294 


AB028945 


Hs.12696 


125289 Z38311 ' 


T34530 


Hs.4210 


446313 Z38465 


H06245 


Hs.106801 


431342 Z38525 


AW971018 Hs.21659 


433227 Z38538 


AB040923 Hs. 106808 


428306 Z385S1 


AB037715 HS.1B3639 


424624 Z38783 


AB032947 Hs.151301 


125295 Z39113 


AB022317 Hs.25887 


125298 Z39255 


AW972542 Hs289008 


125300 Z39591 


Z39591 


Hs.101376 


448378 Z39783 


BE622770 Hs*64915 


444582 Z39920 


R55344 


Hs22142 


130882 240168 


AA497044 HS.20SB7 


128888 Z40388 


AI760853 


H3^41558 


125310 Z40646 


R59161 


Hs.124953 


125315 Z41697 


R38110 


Hs.106296 


125317 Z99349 


Z99348 


Hs.112461 


135096 Z99394 


AA081258 





gbydfl2d01s1 Soares fetal liver spleen 
gb:yd82f07 j1 Soares fetal Over spleen 
gb:yd77b07s1 Soares fetal fiver spleen 
transcription factor Dp-1 
ESTs, Highly sMbr to AUU6JWMAN ALU S 
EST, Moderately similar to S65657 alpha- 
gbys40a03^1 Soares fatal Over spleen 
ESTs, Moderately sJmDar to I38022 hypot 
deltex (DrosopMa) hornobg 1 
8byf35f11 j1 Soares fetal Over spleen 
EST 

fibrillin 2 (co ng enital contracture! ara 
ESTs 

KIAA1321 protein 

Empirically selected from AFFX single pr 

cbondrolfln sdfate proteoglycan 6 (bama 

activ1ty-4ep8ndent neuroprotective prate 

ESTs 

ESTs 

ESTs 

hypothefical protein LOC57187 

hete rochmmafin-Cke protein 1 

hypothetical protein MGC4251 

H1 hlstone famJy, member 0 

sex comb on mUteg (Drosophila}-Dke 1 

S100 raldum-bindlng protein A9 (calgran 

ESTs, Weakly similar to TSP2JWMAN THROM 

ESTs 

hypothetical protein MGC2749 

ESTs. Weakly similar to ALU5_HUMAN ALU S 

DKFZP56MG2022protBlri 

ESTs, Weakly similar to 138022 hypothefJ 

hypothetical protein FU14495 

ESTs 

ESTs 

ESTs 

gb:zn45g10.r1 Stratagerte HeLa cell s3 93 
ESTs, Highjy similar to LCT2_HUMAN LEUKO 
chrornodornaln heOcase DMA binding prote) 
tumor necrosis factor, alpha-Induced pro 
ESTs 
EST 

enhancer of potycombl 

ESTs 

ESTs 

ESTs 

periEph 

destrin (actin depolymenzing factor) 
cortactin SH3 domain-binding protein 
Homo sapiens cOMA RJ 13069 fss, done NT 
ESTs, Weakly similar to PC4259 ferritin 
ESTs 

ketch (DrosophiiaHikfl 1 
hypothetical protein HJ10210 
Ca2+dependent activator protein for seer 
soma domain, Immunoglobulin domain ((g), 
Homo sapiens cONA: FU21814 hs, clone H 
EST 

Homo sapiens cONA RJ12908 (is, done NT 

cytochrome b5 reductase 05R.2 

hypothetical protein FU 10392 

ariadne prosophOa) homolog 2 

ESTs 

ESTs 

ESTs, Weakly similar to 138022 hypothec 
zinc finger protein 36 (KOX 18) 
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TABLE 3A 

Table 3A shows the accession numbers far those pkeys lacking unigenelD's for Table 3. The pkeys In Table 7 tacking unigenelD's are represented within 
Tabtes 1-6A. Fa each probeset we have listed foe gene ctosierrujntef from wWch &te oflgonudeo&des vrere designed. Gene dusters were compBed 
using sequences derived from GenbankESTs and mRNAs. These sequences were dustBred based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each duster are Bated m the 'Accession" 
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Ftoy: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 

Pkey CAT Number Accession 

108469 116761J AA079487 AA128547 AA128291 AA079587 AAG79600 
124106 125446.1 H12245AA094769R14576 
20 108501 13684.-12 AA083256 

108562 36375.1 AA100796 AF020589 AA074629 M075946 AA100849 AA085347 AA126309 AA07931 1 AA079323AA085274 

101300 4669 J BE535511 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 

AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 M448690 C21404 
„ c AW884390 AA345454 AA303292 AA174174 BE032290 T90614 AA035104 R76028 AA126924 AA741086 AW022056 

25 AW1 16940 AA121666 A1832409 AA683475 AI140901 AI623576 AW519064 AW474125 AI953923 AI735349 AW150109 

AJ436154 AW118130 AW270782 AI804073 N27434 AA876543 AA937815 AI051 166 M505378 AI041975 AI335355 
AI089540 AA662243 A1127912 AI925604 A1250880 AI366874 A1564386 A1815196 AI683526 AI435885 Al 160934 H79030 
AI801493AA448691AI673767AKJ76M2AIM327^ 
, n AA90780SAA911483AIS88411AI37iaS8Am76236AI078866AW5im» 

3U AA296025AA523155 AA853168 AI696593 A1658482 AI566601 AW072797 AA12S047 AA035502 AW243274 AA992517 

R43760 

132091 94851J AW954243 AA829930 AA412478 AA82B434 AA814538 AJ927418 AI192435 W52897 AA443666 AA031913 A1683306 

AA918481 A1183314 D83907 A1206832 AA876122 D83836 D83838 D82533 A1761290 AI191125 AI143749 AW771909 
- c AI241436 A1767267 W56507 AA847787 AA568692 T10502 AI247870 AA715017 AA643304 AA890233 AA811387 

AA897470 AA907729 A1708679 A107BG10 AA452830 AW419160 A1783713 N80205 W56778 AA676899 A188871B N69930 
AI338935 A1217580 AA639508AA5758368E046852 A1312S51 AI038406AA628649 AA643838 AM93761 AA032024 
W38849 AA340178 AA447052 AA452969 W19369 AA296364 H44229 W53767 C05751 C05835 AT741989 N98532 
AW102617 AA412583 A1922246 W38495 AA355375 AA928571 C06275 AA352500 N93132 
117034 20113_2 U72209 NM.005748 AI655607 AI052758 AA385199 AW956794 H88679 AL135153 A1765644 AA384399 AW956458 

40 AA568443 AA804610 AI873513 H88639 225371 R63456 W44919 

100752 33207_21 T81309 8E019033 R94181 BE019188 NM.000612 J03242 AW411299 BE300064 BE297544 R94182 AW630108 T53723 
D58853 H78073 H60594 BE299560 T48899 H70196 M17426 N77077 S77035 H58384 H61664 H7B54Q TB4527 C17198 
H6Q255H71980 R92644Vm050X00910M29645 R91G^Mtf^^ BE464561 XD6260 

, - R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R881 16 BE313457 H43500 

4 *> T48617 BE313761 H77309 A1207601 X06159 H40413 X03425 T87663 R10627 X03562 M14118 W03982 R97520 H81229 

T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R1 1555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 N78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 M73335 N50464 W00943 
<ft N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 

-> u H68198 H52278 BE207832 N91 192H70332 XD7868 XD7868 H69464 H53782 H73710 R80435 AA553384 AW884176 

N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 AI918470 T54124 BE207982 BE30O177 
N73994 AW882625 N33549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AA41 1890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428AI039521 H92969 N59389 H80538 H72933T90630AA4 11691 N55000 H74225 AA340290 AW957061 T54316 
•>•> AA340437 H57125 H58908 H79027 K63450 N74623 R93425 H68714 H68758 N68396 H48763 N69256 H57320 H53831 

H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
£f\ H47084 R95723 AA634316 AA862781 H77310 R91389 H93111 R92767 T54512 R89341 H70333 H57817 H82941 H62032 

ou N52638 H58385 T91796 H51066 AA340292 T49918 H81230 R36121 N5041 1 T87664 N62436 N39340 AA665637 

AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 A120D549 
AA342640 BE298855 BE250229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H65255 W05198 
AW889144 N76677 H71702 H6B036 H71915 R91612 R87807 H68059 AI133328 AI247866 AA621443 AW881050 
„ AA700847 AA340413 AW878608 AW881 181 AW678249 H71916 N54596 BE161581 AW878082 W04212 AW881040 

0D AW885492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 

H63868 AW884986 AW878713 AW878685 R36391 AW878694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AWB78690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW883641 AW883567 AW883481 AW882983 AW882982 AW882465 AW883419 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 
116417 5418J 1 AW499664 AW500888 AL042095 AW576556 AW265424 AI521500 AA761333 AA761319 AW291137 AA649040 AA769094 

- c AA489664 AA63531 1 AW070509 AA425658 AJ381489 AA609309 AA134476 W74704 A1923640 AW084888 H45700 

75 AJ985564 AW629495 AW614573 A1859571 AI693486 AA913892 A1806164 AA909524 AW263513 A1356361 Z4O708 
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123712 374423J 
117156 145392J 



125003 1802095.1 

125020 116017 J 

125066 1814993.1 

116661 1532859J 

125104 413347 1 

124575 1666649.1 

125263 1547J 

131659 3672J 



125565 1704098.1 
132983 11922.1 



118584 532052.1 
133607 1227.6 



AI332765 AI392620 AA181060 AW1 18719 AW968804 AW263502 AW505314 AA036967 W74741 R51 139 H19364 H45751 

Z44962 AW370823 H25650 T54007 AA453000 AL045739 

AA609684AA758732 

W73853 AA928112 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AJ079403 AI370541 AI697341 

H97538 AW188021 AI927669 W72716 A1051402 AI1B8071 AQ35900 N21488 AW770478 W92522 AB91028 AI913512 

AI144448 W73819 AA604358 N28900 W95221 AB68132 H98465 AA148793 

T91251 T64891 T85665 

T69981T69924AA078476 

T86284T81933 

R615O4F04247 

T95590AA703278 H62764 

N6B168N69188 N90450 

AA098878 W88942 

AW960564 AA092457 T55890 D56120 T92525 A1815987 BE 182608 BE182595 AW080238 M90657 AA347236 AW961686 
AW176446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA315188 AA308638 AA112474 W76162 AA088544 H52265 
AA301631 H80982 AA 113786 BE620997 AW651691 AA343799 BE613669 BE547180 BE546656 F11933 AA376800 
AW239185 AA376086 BE544387 BE619041 AA452515 AA001606 AA190873 AA180483 AA159546 F00242 AI940609 
AI940602 AI189753 T97663 T66110 AW062896 AW062910 AW062902 AI051822 AI828930 AA102452 AB85095 AI81939Q 
AA557597 AA383220 AI804422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA083510 
AW629109 AW513200 AA921353 AJ677934 AJ 148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591384 R79300 R80023 AA843108 AA62605B AA844898 AW375550 AA889018 AM74275 AW205937 
AI052270 AW388117 AW388111 AA699452 AC42230 N47476 H38178 AA366621 AA113196 AA130023 H39740 T61629 
A1885973 AW083671 AA179730 AA305757 AI285455 N83956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771981 AI285092 AI591386 BE392486 BE385852 AA682601 A1682884 AA345840 T85477 AA292949 
AA932Q79 AA098791 082607 T48574 AW752038 C06300 
R20840R20839 

M30269 NM.002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 
AA333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 A1818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 056097 D56381 
W21232 AA190565 AW379755 AW067895 

AW136928 AI685655 BE218584 BE465078 N68963 AA975338 BE147199 N76377 

BE273749 BE397561 BE387189 AL037858 AL037878 AJ963Q94 BE259216 AA01 1363 AU036189 BE562325 AA251 169 
BE617431 N98537 AA158093 AL047800 M34539 NMJ000801 AA312140 D16971 AA158904 AA3071 14 AA312803 
T09203 AW629686 AL048504 BE388578 AA220957AA1 58364 BE267385 AA294971 C18055 BE241757 AA1 15056 
AI936769 BE378435 BE206971 AW674924 BE622060 AA604674 AA1 15273 AW402159 AA338608 BE568819 M80199 
X55741 AA3751 1 1 AA376016 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
BE543239 AA082517 A1201645 AI201642 A1192622 N40104 AA370921 BE547569 AI969602 AA302038 AI197890 
AW268354 AI0 14938 W45448 AI541395 AA037272 BE538826 AL039613 BE536130 AA299355 AW805147 AW974624 
H53220 A1471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AI498844 
AV662092 AW904105 AA011375 BE315214 H99302 BE537893 N32299 AW855829 AI291320 BE078322 AI301395 
AA303362 N32719 AA358328 AA357877 AI952540 H56279 H02758 H02048 AWB05233 R82224 AA410772 AA291352 
BE171 109 N69935 BE169248 AA361173 H44978 BE617887 D52560 AA084043 W03595 R67219 N38477 N42924 R67104 
H44901 H79695 W21105 AA393988 W30899 AA316096 BE622896 W46872 AA442678 BE544893 BE540112 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03206 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AA029796 AA994960 H19567 AA442969 H49781 H46871 AA035395 AA056185 AA149378 
AA643080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA513790 AA563962 AA398076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 AI339329 AA304424 AA605236 AA936934 
AA587673 AI209162 AI697301 AM79995 AJ679814 A1381950 AW189125 AJ955888 AI986019 BE301019 AI084792 
AI31021 1 AW189307 AI022070 AW977204 AI146825 AW190163 AW303281 AIB28345 BE046043 AW029257 AA482268 
A1246507 AI420729 AW084932 AW439514 AI890487 AW439692 A1523896 A1186612 A1659953 AI889773 AA687527 
AWD72694 AW262153 AW467371 AI613269 A1679238 D54404 AA158103 AW105527 AW1 49739 AW150361 AW268387 
AW117708 AI951682 AI687440 AW674285 AA678365 AI587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765891 AA612935 AI814658 AW316916 R66594 AA514640 AA025040 AA031472 AW732076 AA029797 AI244560 
AI128734 AW381720 AI092360 AI263283 AW613175 AI890675 AI720156 AW631348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078661 AC92052 AW381707 AI147854 AW381743 AA158905 AA303258 AA888144 AW195967 
AA428706 AA989559 AA617731 H19882 BE543418 AA830386 AM21302 W58652 T94995 AI869743 A1679145 
AW085971 N98425 AA765136 A1347027 AI356955 AA928038 AI679717 AA458459 AA679281 AB67973 A127Q041 
AA7S5135 AA732793 AI798447 AA668646 AA251008 A1984538 AI401737 AA0561 86 BE0433O8 AW662375 A13021 10 
N5G724 W96332 BE537047 N26983 A1567172 AA765296 AW673237 N29784 AA534275 AA084044 AW067973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW069878 N67751 H05982 N23140 AI362647 
A1302086 AI767772 N25755 H531 14 AA706133 T93511 AA429291 AA935294 AA987647 W02803 R66595 AS80795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA23531 2 W52205 N93444 R82180 
H02759 H79696 AW088894 H56079 AA961143 AW067776 AW973745 AA016311 AW071227 AA01751 1 A1753994 
W47374 T64155 AA296092 AJ698626 AA558158 AA296088 AW794259 H0 1 963 AA 149267 AA485076 AA975856 H44938 
AA035396 AJ955555 H46289 AA486161 A1631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 A1005160 BE242548 AW992803 AW878644 AW878666 727742 
R82471 AW517604 AW472738 AI282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 A1904376 A1832590 H19484 R00890 AI627677 AA302287 AI869451 AI734855 AT708073 
AI832902 AA585184 AW204299 AA055565 D12417 01 1975 T63543 AW664099 R54423 BE612712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 AI076813 AI022644 R02585 X52220 AW296894 AA825671 
AJ699321 A1393601 AW592611 AI146747 AA608921 AA158365 AW590007 AA354519 D20081 R02704 AW798339 
M92422AA094903AA007676 
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133681 13893J 



134403 17037.1 



135096 33756.2 
103767 34817.1 



103855 84277.1 



126872 
113026 



120284 
112540 
111904 
121094 
128510 
114106 
121335 
120761 
122050 
130018 
100104 
121822 



108280 
108309 
107832 
123523 
123533 
132225 
125017 
125063 
125064 



84431J 



158963.1 

1605263.1 

1719336.1 

275729.1 

19829.1 

1182096 1 

279548.1 

224903 1 

273507.2 

18986.1 

19974.4 

244391.1 



110682.1 
111495.1 

genbanKJW)21473 
gertoank_AA608588 
genbankJ\A608751 
genbank_AA1289BO 
genbanU58875 T68875 
geobanK_T85352 T85352 
genbanK_T85373TB5373 



AI352558 Z82248 X78138 NM.003405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614039 BE296713 
BE621334 L20422 X80536 054224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA 192540 K67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H11706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA322911 AA322961 H60980 NB5248 N31547 
H79624 T1 1718 W85828 AWB94663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82S92 BE155392 AW383636 BE155394 AA487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25384 AA367935 A1559408 M033522 AA374252 
AW835019 AI922133 AB97089 N99662 AW189078 A1 199076 AW151598 W59944 AA662875 W94022 AA299055 
A1039008 A1829449 AA583503 AI635674 AW131665 AI473820 AW273118 AW900930 AA908944 AI688035 A W1 70272 
d082545AW468176A!608761A!0a2748Al911682Al2^^ 

AI809817 AA905196 AI191245 AM70204 AJ188296 AW21367 A1125315 AI087141 AA629032 AA740589 A1554181 
AA150830 AI248541 AI077943 AA775958 AA864930 AC61476 AJ123121 A1310394 AA862331 AA872478 BE537084 
AI205606 AA720684 AI872093 AW150042 AL 120538 AA219627 AA988608 C21397 AI359337 H25337 AJ089749 
AA605146 AI359620 M150478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 UJ8439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28C31 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520AWQ28427AA171496AI469689 AW664539AI811102A1811116BE464590BE35O791 H78Q21 T15405H21979 
AA219489 H13301 AA505883 A1864305 AI423963 AW084401 RM963 R69858 H67097 A1917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H11617N79982AI174338A1904207 AI904208 BE614558 
W94127 W65436 A1272249 AA700018 AI57S932 AJ085941 AW152629 

AA334551 BE008229 AA307537 AW961156 AW995894 AW995826 NMJJ06751 M61199 AA045603 AL036372 AV645608 
A1686095 AW351901 AA101337 AA101345 N73342 BE0 18030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AW023134 AW994986 AW994989 BE090429 R23614 AI567932 H03726 
H01 101 H01867 AA548743 AI671806 AW872949 AW872941 AA742447 AI199788 AA045604 AI637465 AT741796 
AW24221 7 AW131463 AI765302 AI683923 AA889762 AI804889 AI986437 C06049 BE502340 A1695651 AI491970 
AA496804 AA281008 AA665699 AI473814 BE301445 AA707837 AA551925 AI017348 AI20B185 AA775203 AA156296 
AA557463 H95441 AA768547 AW769358 AA991 197 AA181954 AI091389 AJ147289 AW771 837 AI638582 AA84441 1 
AJ37475G T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW 183290 AA158453 N68125 N69039 AA100423 
AA101346 A1918720 HO1 102 R67868 H01 868 N66438 R4658G AI858433 AA599560 AA187577 AA157481 AA361520 
AUM7827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R 13355 AA281 174 AA428908 
AA081258 AA160311 W17034 H83596 Z99393 AI831206 AW771 108 AW769214 N89775 AW161495 AW161522 
AW160880 Z99394 AI814820 

BE244667 BE241813 BE242271 AA381943 NM.016040 AF151858 AW967497 AW966873 AJ824386 AW470133 
AW015765 BE018650 AW503659 AI129838 A1632346 AA013O99 AW77051 1 BE219482 AI824135 AI867379 AA019348 
AA285143 AW087624 AI9901C0 AA251084 AJ633962 AA287714 AA400773 A12921 12 AW469095 AA743312 AW117423 
AA694551 AA885657 AA1 12675 BE327333 AA082161 H03613 AA094735 AW500235 N28878 AA287713 AW3O0233 
AA826249 K46921 BE348728 AW505056 AW966879 AI521202 AA393405 AC64668 AA910851 AA251721 AW70834 
H03503 AA089688 R58562 BE004728 AA668793 H27167 R54717 

W02363 N80298 AA304486 AW954799 AW805136 AW970817 AW373398 AW875459 AA136805 AA683501 N73299 
AW341082 A1632954 AA493369 AI478433 AI03791 1 AW272169 AW043832 AA010683 AW629090 AW183622 N64510 
AW079953 AJ554533 AA563670 AA010682 AW237610 AW419057 AI470926 AI627B33 AA195080 AA195179 AJ471443 
AW590266 AI1 68477 AW771 214 AI767341 AW340O86 AW748455 A1280079 AI244821 AI381283 AW300130 AW183374 
AW195397 AA136708 AI824598 AW573004 Z98448 AA905255 AI497883 

AW450979 AA136653 AA136656 AW4 19381 AA984358 AA492073 BE168945 M809054 AW238038 BE01 1212 BE011359 
BE011367 BE011368 BE011362 BE01 1215 BE01 1365 BE011363 

AA376654 W76367 AA318232 AI694545 AT742403 AI887383 AW204731 AW874431 BE220997 AA1 14979 AA303838 

A1002267 AW952031 W74801 AA011287 AA115112 AI306385 R37677 AW571707 R59986 W94102 AW197042 H 10206 

AW139819 AJ686172 A1674165 R51633 Ai367086 T23948 H10833 H23002 H1 1743 R37085 239208 H22794 H1 1820 

R13817 Z43122 H10257 R88398 R18795 AA010848 R67191 H10875 R67170 

AA179656 AA182626 AA162603 

R69751 R70467 K69771 H80879 H80878 

Z41572R39330 

AA402505AA398900 

X94703 NMJXM249 R52316 T87420 N46403 Z36855 BE076834 
AW602528BE073859 Z38412 
AA404418A1217248 
AA321 890 R1 8000 

AJ453076 AI376075 AI014836 AA628633 AA961066 AI150282 AI028574 A1217182 AA732910 AA431478 AL041229 

AA353093 AW957317 AW872498 AI560785 AI289110 AW135512 X97261 T68873 

AFO08937 

AI743860 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 A161 1424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AJ61 161 1 AI611407 AI611452 W56437 At284566 
A1583349 AW183058 AI308085 AI074952 AA437315 AA628161 AW301723 AI150224 AA400137 AA437279 A1223355 
AA639462 AI261373 AM32414 AI984994 A1539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 A1580502 A1832944 A1038530 AA425107 AJ014986 A1148349 AW237721 AW779756 
AW137877 AI125293 AA400404 R28554 
AA065069AA085108 

AA069818 AA069971 AA069923 AA069908 
AA021473 
AA608588 
AA608751 
AA128980 
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125091 genbanJLT91518 T91518 

100964 entnsJ00212 J0O212 

102269 entre*_U30245 U30245 

125150 NOT_fOUND_entraOV38240 W38240 

123964 genban)LC13961 C13981 

118111 genbanlLN55493 N55493 

118129 genbankJ457493 N57493 

102491 efrtre^U51010 U51010 

1 18329 genb3nU*63520 N63520 

118475 gonbanX_N66845 N66845 

1 18581 genbanLK68905 N68905 

111514 gentenk_R07998 R07998 

104534 R223D3 jat R22303 

120340 g8nban»LAA206B28 AA206828 

120376 genbanlUVA227469 AA227469 

104787 gent>anMA027317 AA027317 

120409 genban)LM235O50 AA235050 

120745 genbank_AA302809 AA302809 

120809 genbanlLAA346495 AA345495 

120839 genbanLAA343913 AA346913 

113702 genbanK_T97307 T97307 

106596 304084J A1583948 AA578212AW303715 AA653450 AA456981 A1400385 W88533 AI224133 AW272145AA088686 R94698 

113947 genbanK_W84768W84768 

122562 genbanKJW452156 AA452156 

122635 genbanMA454085 AA454085 

108277 genbank_AA064859 AA064859 

108403 genbank>A075374 AA075374 

122860 genbanUW464414 AA464414 

108427 genbanUW076382 AA076382 

108439 genbank_AA078386 AA078986 

131353 231290.1 AW411259 H23555 AW015049 A1684275 AW015886 AW068953 AW014085 A1027260 R52686 AA918278 AI129462 

AA969360 N34869 A1948416 AA534205 AA702483 AA705292 
108533 genbanU\A084415 AA084415 
124254 genbank_H69899H69899 
101447 entraUd21305 M21305 
101458 entrezLM22092 M22092 

101667 13349J NM.005381 M60858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA 187806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 AL036757 AW373711 AW403124 AW403640 
AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AA096007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488AF 114098 AW375993 A1133569 W52561 AA603040 AA133710 A1928795 AW176370 AA827519 
AW338437 AA521142 T29341 AI800461 AW317002 AA703914 AA860830 AI859203 AM45772 AA714334 AI817066 
A1832027 AW510442 AI635B02 AW0883O6 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 
AA074040 AA931657 AW51154 AW410203 AI921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AC67215 AW08QQ82 AW383778 AI933053 AJ919097 W31557 N90245 AA931591 AA553995 
F36352 AA056184 AA476294 AA641327 AA533550 AI749630 W58323 AA5691 19 AA508573 A180905O AI378998 
AA41 1362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 AI270128 AI472365 AA41 1363 AI523216 
AI719965 AI816302 AA182681 AJ707990 AA133588 AI758537 W60253 AW60303 AA135423 A1083904 F04188 N89693 
AW408776 AJ678595 A1270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 D20218 AI273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05566 N23327 AW247953 R44748 AA962015 F03558 
AJ752394 AW409913 AW2483S8 AI816463 AI752393 AA325370 AA263089 AI570130 A1971951 A1160658 AI357360 
AW168686 AL121075 AW050536 N21872 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 AA152476 
AW351940 AW373683 AS40524 AW374953 T56500 N24329 A1940720 AW374933 AW374947 AW391913 AL138337 
AW376241 AW062943 F26666 AW410202 AW082958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 Af745674 AW300951 AI188579 AW38973 AI305271 AA433818 AA612807 AI831809 A194O409 
AA1 58663 AI572988 

124576 genbanK_N68201 N68201 

108931 genbanlLAA147186 AA147186 

108941 genbankJ\A1 46650 AA148650 

124720 144582J R05283R11056 

124793 genbanUM4519 R44519 

124799 genbankJM5088 R45088 

103136 entre?_X55965 X65965 

117683 genbanUM0180N4O18O 

124991 genhanJLT501 16 T501 16 

103432 entreOC97748 X97748 

119174 genbanU?71234R71234 

119239 95573_2 T11483T11472 

133678 • 1 1235J AW247252 AA346143 NM_000270 AA381085 N91995 X00737 AA381079 AA296473 AA296110 AA315735 AA31 1617 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W66334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 A1922219 AI024906 A1096488 AW85005 AA194872 N90489 A1452544 H72411 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N30611 
AI979189 AI280889 AW273191 R66531 AE85845 AI675927 AI421990 AW190879 H37794 AA699667 H68427 AA954388 
AI188757 AI140048 AA430382 AC04151 AW247864 AA559099 AI431420 AA548276 AI1 49466 AA772669 AA694388 
AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA91 3603 AW51 1745 AI500697 AA814922 AA835040 
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T47964 H53998 AA975804 RS8710 AI077604 N70252 R98084 AW250171 H69268 AJ597614 AA970746 AA972548 
AI3771 16 R62962 H16737 R89070 AA731329 R66532 N54354 AB 18832 H81944 M71567 T95122 W86463 AA437095 
A1431999 AS15724 N83851 AI674743 AA457307 AA211475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AJ269937 H64029 H89728 R65B19 AW470496 A1873318 A1735713 H82987 C02447 AJ478668 T27651 
5 A1599770 AW025156 H69719 AJS84717 hB9225 AI459356 AA953S77 AI424691 H13843R22404 AJ87379SA1336002 

N70898 AI420854 AA541792 AA346142 AI000614 AI828348 AA045090 T51257 N90434 H 13890 N731B4 AI708083 
AA781606 AA329050 AA339985 R68984 H6479S W04186 H16845 

119416 genbanKJ97186 T97188 

119558 NOT_F0UND ente^W38194 W38194 
10 119559 Narj=OUND_entra?_W38197 W38197 

119654 genbanlLW57759W57759 

121350 genbanfcjAA405237 AA405237 

121558 genbanUA412497 AA412497 

105985 genbanlLAA406610 AA406610 
15 114648 genbanU^101056 AA101056 

121895 genbanKJVA427396 AA427396 

100327 ento*J>55640 D55540 

123315 714071.1 AA496369 AA496646 

123473 genbanKJ*A599143 AA599143 
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TABLE 4: 

Pkey: Unique Eos probeset identifier number 

Accession: Accession number used for previous patent (Dings 
ExAcor Exemplar Accession number, Genbank accession number 
UnlgenelD: Unkjene number 
Unfcene Title.' Unigene gene tffle 



Pkey Accession ExAccn UniGene UnigeneTitie 



100405 
100420 
100481 
100484 
100718 
100991 
101097 
101168 
101194 
101261 
101345 
101447 
101485 
101543 
101550 
101560 
101674 
101714 
101741 
101838 
101857 
102012 
102024 
102164 
102241 
102283 
102303 
102564 
102663 
102759 
102778 
102804 
102887 
102898 
102915 
103036 
103037 
103095 
103158 
103166 
103185 
103280 
103554 
103850 
104465 
104592 
104764 
104786 
104850 
104865 
104894 
104952 
104974 
105178 
105263 
105330 
105376 
105729 
105826 
105977 
106008 
106031 
106124 



D86425 
D86983 

HG1098-HT1098 

HG1103-HT1103 

HG3342-HT3519 

J03764 

L06797 

L15388 

120971 

L35545 

176380 

M21305 

M24736 

M31166 

M31551 

M32334 

M61916 

M68874 

M74719 

M92934 

M94856 

U03057 

U03877 

U18300 

U27109 

U31384 

U33053 

U59423 

U70322 

U81607 

U83463 

U89942 

X04729 

X06256 

X07B20 

X54925 

X54936 

X60957 

X67235 

X67951 

X69910 

X79981 

Z18951 

AA187101 

N24990 

R81003 

AA025351 

AA027168 

AA040465 

AA045136 

AA054087 

AA071089 

AA085918 

AA187490 

AA227926 

AA234743 

AA238559 

AA292694 

AA398243 

AA406363 

AA411465 

AA412284 

AA423987 



AW291587 Hs.62733 
D86983 Hs.118893 
X70377 Hs.121489 
NM_005402Hs288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 K&B9414 
NM_005308Hfc211569 
L20971 Hs.188 
D30857 Hs.82353 
NM.005795HS.152175 
M21305 

AA296520 Hs.89546 
M31166 HsJ2O50 
Y00630 Hs.75716 
AW958272 Hs34732B 
NM_002291Hs.82124 
M68874 Hs.211587 
NML003199HS.326198 
BE243845 Hs.75511 
BE550723 Hs.1 53179 
BE259035 Hs.118400 
AA301867 Hs.76224 
NM-000107H3.77602 
NMJW7351HS.268107 
AW161552 H&83381 
U33053 Hs.2499 
U59423 Hs.79067 
NM.002270HS.168075 
NMJW51Q0HS.788 
AF000652 Hs.8180 
NM_002318Hs.83354 
J03836 Hs-82085 
NM_002205Hs.149609 
X07820 Hs2258 
M13509 Ks.83169 
BE018302 Hs.2894 
NM.005424HS.78824 
BE242587 Hs.118651 
AA159248 Hs.180909 
NM.006825HS.74368 
U84722 Hs.76206 
AI878826 Hs.74034 
AA187101 Hs^13194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AJ039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.16858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
Hs.7567 



rridogen 2 

Melanoma associated gene 
cystatin D 

v-ral simian teutemJa viral oncogene horn 
inhibitor of DMA binding 1 , dominant neg 
serine (or cysteine) proteinase mhfbltD 
chemoktne (C-X-C motif), receptor 4 {(us 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B, cAMP-spedffc (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gbftuman alpha satellite and satellite 3 
se lectin E (endothelial adhesion mo tear! 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase (nhibito 



laminin, beta 1 

phosphoDpase A2, group IVA (cytosoDc, 
transcription factor 4 
connective tissue growth factor 
fatty add binding protein 5 (psoriasis- 
singed (Drosophlla)-tike (sea urchin fas 
EGF-containlng fibuIin-Eke extraceEula 
damage-specific DMA binding protein 2 (4 
mtdSmerin 

guanine nucleotide binding protein 11 

protein kinase C-Bke 1 

MAD (mothers against decapentaptegic, Or 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (synlenln) 

iysyf oxldase-like 2 

serine (or cysteine) proteinase inhbito 
Integrtn, alpha 5 (fibronecfin receptor, 
matrix metailoproteinase 10 (stromelysin 
matrix metailoproteinase 1 (interstitial 
placental growth factor, vascular en doth 
tyrosine kinase with immunoglobutin and 
hematopoietically expressed homeobox 
peraxiredoxin 1 

transmembrane protein (63kD), endopbsmi 
cadherin 5, type 2, VE-cadherin (vascula 
caveoHn 1, caveolae protein, 22kO 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

WAA0955 protein 

hypothetical protein OKFZp434G171 
B-cell CLL/tymphorna 6, member B (zinc fi 
phosphoiipase A2, group IVC (cytosoBc, 
desmopbldn (DPI, DPU) 
brornodorrato-contalning 4 
AD036 protein 

solute carrier family 7, (cafJonic amino 
ESTs 

hypothetical protein FU10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3ublquitinPgaseSMURF2 

hypothetical protein FU1 1 1 10 

SRY (sex determining region Y>box 18 

Homo sapiens cDNA: FU22296 fts, clone H 

Homo sapiens cDNAj FU21962 fis, clone H 



150 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



106155 
106302 
106423 
106793 
107174 
107216 
107295 
107385 
108756 
108846 
108888 
109001 
109166 
109456 
109768 
110107 
110906 
110984 
111006 
111018 
111133 
111760 
113073 
113195 
113923 
114521 
115061 
115096 
115145 
115819 
115947 
116314 
116339 
116430 
116589 
116733 
117023 
117186 
117563 
117997 
118475 
118581 
119073 
119155 
119174 
119221 
119416 
119866 
121335 
121381 
123160 
123473 
123523 
123533 
123964 
124006 
124315 
124659 



AA425309 

AA435896 

AA448238 

AA478778 

AA621714 

O510&9 

T34527 

U97519 

AA127221 

AA132983 

AA135606 

AA156125 

AA179845 

AA232645 

F10399 

H16772 

N39584 

N52006 

N53375 

N54067 

N&4436 

R26692 

T33637 

T57112 

W80763 

AA046808 

AA253217 

AA255991 

AA258138 

AM26573 

AA443793 

AA490588 

AA498257 

AA609717 

D59570 

F13787 

H88157 



124847 
124875 
125091 
. 125103 
125355 
125565 
125590 
423765 
126511 
100286 
126563 
126649 
449602 
126872 
456000 
414221 
127402 



N34287 

N52090 

N66845 

N68905 

R32894 

R61715 

R71234 

R98105 

T97186 

W80814 

AA404418 

AA405747 

AA488687 

AA599143 

AA608588 

AA608751 

C139B1 

D60302 

H94892 

N93521 

N95477 

R60044 

R70506 

T91518 

T95333 

R45630 

R20839 ' 

R23858 

R23858 

AI024874 

W26247 

W26247 

AA856990 

AA856990 

AA136653 

AA136653 

AA136653 

AA358869 



AA425414 Hs.33287 
AA398859 Hs.18397 
AB020722 Hs.16714 
H94997 Hs.16450 
BE 122762 Hs.25338 
051069 Rs.211579 
AA186629 Hs.80120 
NM_005397H3.16426 
AA127221 Hs.117037 
AL117452 Hs.44155 
AA135608 Hs. 189384 
A1056548 Hs.72116 
AA219691 Hs.73625 
AW956580 Hs.42699 
F06838 Hs.14763 
AW151660 Hs.31444 
AA035211 Hs.17404 
AW613287 Hs.80120 
8E387014 Hs.166146 
A1287912 Hs.3628 
AW560939 Hs.97199 
BE551929 Hs.268754 
N39342 Hs.103042 
H 83265 Hs.8881 
AW953484 Hs.3849 
AW139036 Hs.108957 
AI751438 Hs.41271 
AI683069 Hs.175319 
AA740907 Hs.88297 
AA486620 Hs.41135 
R47479 Hs.94761 
AI799104 Ha.178705 
AK000290 Hs.44033 
AK001531 Hs.66048 
AI557212 H3.17132 
AL157424 Hs.61289 
AW070211 Hs.102415 
H96988 Hs.42612 
AF055634 Hs.44553 
N52090 Hs.47420 
N66845 
N66905 

BE245360 Hs.279477 
R61715 Hs.310598 
R71234 

C14322 Hs.250700 
T97186 

AA496205 Hs.193700 
AA404418 

AW088642 Hs.97984 

AA488687 Hs.284235 

AA599143 

AA608588 

AA608751 

C13961 

AI147155 Hs.270016 
NMJW5402HS.288757 
AI680737 Hs.289068 



Hs.102943 
Hs.304177 
Hs.285814 



AI571594 
W07701 
AS887664 
T91518 

AA570056 Hs.122730 
R60547 Hs.170098 
R20840 

R23858 Hs.143375 
R23858 Hs.143375 
T92143 Hs.57958 
BE247550 Hs.86859 
AA516391 Hs.181368 
AA001860 Hs.279531 
AA001860 Hs.279531 
AW450979 
BE180876 Hs.11614 
AW450979 

AA358869 Hs.227949 



nuclear factor l/B 

hypothetical protein RJ23221 

Rho guanine exchange {actor (GEF) 15 

ESTs 

ESTs 

melanoma cell adhesion molecule 

UDP-N^tytel^ia^alactosamiiiD:polyp 

podocalyxln-like 

ESTs 

DKFZP586G1517protBin 

gb:zH0aO5.s1 Scares jjregnanLutems.NbH 

hypothetical protein FU20992 similar to 

RAB6 interacting, (dne^n-Uke (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-^^eoety^Ipra^alado3amIne:polyp 
Homer, neuronal Immediate early gene, 3 
mitogen .activated protein kinase kinase 
complement component C1q receptor 
Homo sapiens cONA RJ1 1949 Qs, clone HE 
mJcro tubule-associated protein IB 
ESTs, Weakly simSar to S41044 cruomosom 
hypothetical protein FU22041 similar to 
40S ribosomal protein S27 feofbrm 
Homo sapiens mRNA full length Insert cON 
ESTs 
ESTs 

endomucln-2 
KIAA1691 protein 

Homo sapiens cONA FU1 1333 fis, done PL 

dtoapfidyt peptidase 8 

hypolheticaj protein FU10669 

ESTs, Moderately similar to 154374 gene 

synaptojanln 2 

Homo sapiens mRNA; cONA DKFZp586N0121 (f 
ESTs, Weakly similar to ALU1JIUMAN ALU S 
unc5 (C.elegans hornolog) c 
EST 

gb:za46c1 1 .si Scares total Over spleen 
gb:za69b09.s1 Scares Jetat_lung_NbHL1 9W 
ESTs 

ESTs, Moderately similar to ALU1_HUMAN A 
gb:yl54c08.s1 Scares placenta Nb2HP Homo 
tryptase beta 1 

gb:ye50h09.s1 Scares total Over spleen 
Homo sapiens mRNA; cONA DKFZp586l0324 (f 
gb;zw37eQ2-s1 Soares_tDtal_fetu5_Nb2HF8_ 
hypothetical protein FU 22252 similar to 
ESTs, Weakly similar to 138022 hypotheb* 
gb:ae52d04.s1 Stratagene iung carcinoma 
gb;ae54e06.s1 Stratagene lung carcinoma 
gb:ae5Sn07.s1 Stratagene Eung carcinoma 
gb:C13961 Clontech human aorta polyA+ mR 
ESTs 

v-ral simian leukemia viral oncogene horn 
Homo sapiens cDNAFUl 1918 lb, clone HE < 
hypothetical protein MGC12916 
Homo sapiens clone FLB8503 PR02286 mRNA, 
sprouty (DrosophiSa) hornolog 4 
gb.ye20fD5.s1 Stratagene lung (937210) H 
ESTs, Moderately similar to KIAA1215 pro 
WAA0372 gene product 
gb:yg05cO8rl Scares Infant brain 1N16 H 
Homo sapiens, clone IMAGE:3840937, mRNA, 
Homo sapiens, clone IMAG&3840937, mRNA, 
EGF-TM7-latrophi5n-f elated protein 
growth factor receptor-bound protein 7 
U5 snRNP^specflic protein (220 kD), orih 
ESTs 
ESTs 

gblIKH-Bi3^la-a-12-0-Ut^1 NCI_CGAP_Su 
HSPC065 protein 

gb:Ul-H*l3-ala*12-04JU1 NCI_CGAP_Su 
SEC13(S.cerevisiae)-Gke1 



151 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 
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127651 
424608 
128062 
128992 
129046 
129188 
129314 
129371 
129468 
129765 
129805 
129884 
130495 
130639 
130657 
130828 
130972 
131080 
131137 
131182 
131486 
131573 
131647 
131756 
131859 
131881 
132050 
132083 
132164 
132358 
132413 
132456 
132490 
132676 
132687 
132718 
132736 
132760 
132933 
132968 
132994 
133061 
133147 
133161 
133200 
133260 
133363 
133491 
133517 
133550 
133607 
133614 
133627 



AI123976 

AI123976 

AA379500 

R49693 

AA195678 

M30257 

AA028131 

M10321 

J03040 

M86933 

AA012933 

AA286710 

AA243278 

059711 

T94452 

AA053400 

AA370302 

J0500S 

U85193 

AA256153 

X83107 

AA046593 

AA410480 

D45304 

M90657 

AA010163 

AA136353 

Y07867 

U84573 

X60486 

AA132969 

AA114250 

F13782 

AA283035 

AB002301 

AA056731 

U68019 



133696 
133913 
133975 
133985 
134039 
134088 
134161 
134299 
134416 
116470 
134656 
134989 
135051 
135073 
135349 
100114 
100130 
100143 
100168 
100208 
100224 
100405 



AA598702 

N77151 

AA505133 

AB000584 

D12763 

AA253193 

AA432248 

AA083572 

AA479713 

L40395 

X52947 

W80846 

M34539 

D67029 

U09587 

M85289 

D10522 

W84712 

D29992 

134657 

S78569 

D43636 

U97188 

AA487558 

M28882 

X70683 

X14787 

AA236324 

C15324 

AA452000 

083174 

D00596 

011428 

D13640 

014874 

026129 

028476 

086425 



AA382523 Hs. 105689 
AA3S2523 Hs. 105689 
AA379821 Hs,1Q5547 
H04150 Hs.107708 
AB029290 Hs.108258 
NM.001078HS.109225 
8E622763 Hs.290356 
X06828 Hs.110802 
AW410538 Hs. 111779 
M86933 Hs.1238 
AA012848 H3.12570 
AF055581 Ha.13131 
AW250380 HS.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs.21739 
NMJM1955HS.2271 
W27392 Hs.33287 
A1824144 Hs.23912 
F06972 Hs.27372 
AA040311 Ks.28959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 H3.3383 
A1267615 Ks.38022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NM_003542H3.46423 
AW361383 Hs.260116 
A6011084 Hs.48924 
NM_001290Hs.4980 
N92589 Hs.261038 
AB002301 Hs.54985 
NM_004600Hs.554 
AW081883 Hs.211578 
AA125985 H3.56145 
BE263252 Ha.6101 
AF234532 Hs.61638 
AA1 12748 Ks.279905 
AI186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6531 
AB037715 Hs.183639 
AA403045 Hs.6906 
A1866286 Hs.71962 
BE619053 Hs.170001 
NW.000165HS.74471 
A1129903 HS74669 
BE273749 

NMJJ03003H3.75232 
NM_0O2047Hs.7528O 
M85289 Hs.211573 
A1878921 H3.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78148 
NMJW2290HsJ8672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
A1272141 H3.83484 
AI750878 Hs.87409 
AW968058 Hs.92381 
A1272141 H3.834S4 
W55956 Hs.94030 
AA114212 Hs.9930 
X02308 Ks.82962 
NM_000304H3.103724 
AU076465 Hs.278441 
H73444 Hs394 
NMJW2933HS.78224 
AL121516 Hs.138617 
AW291587 Hs.82733 



MSTP031 f 
MSTP031f 
neural profiferatfon, differentiation an 
ESTs 

acun Dtnaing protein, macropnin (mtuui 
vascular call adhesion molecule 1 
mesoderm development candidate 1 
von WDIebrand factor 
secreted protein, acidic, cysteine-ricn 
arnelogenii (Y chromosome) 
tubuHn-spedfte chaperone d 



mitochondrial dbosomaJ protein L12 
ESTs, Moderately similar to 154374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586H518 (f 
endotneCn 1 
nuclear factor t/B 
ESTs 

BMXriorwecept3f tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamfly member 1 
upstream regulatory element binding prot 
ESTs 
Pain 

procollagen-lysine, 2-oxoglutarate 5-dto 
H4 hfetone family, member G 
metalloprotease 1 (plmTysln family) 
WAA0512 gene product ALEX2 
UM domain binding 2 
ESTs. Weakly similar to I36022 hypothetl 
KIAA0303 protein 

Sjogren syndrome antigen A2 {60kO, ribon 
Homo sapiens cDNA: FU23Q37 fis, clone L 
thymosin, beta, Identified In neuroblast 
hypothetical protein MGC3178 
myosin X 

clone HQD310PRO0310p1 
prostate differentiation factor 
Interieukin 1 receptor-like 1 
hypothetical protein FL120373 
hypothetical protein FU10210 
Homo sapiens cDNA FIJ231 97 (is, clone R 
ESTs, Weakly similar to B36298 proflne-r 
eukaryoUc translation initiation factor 
gap junction protein, alpha 1, 43 kD (con 
vesicle-associated membrane protein 5 (m 
FK50&-bindbg protein 1A (12kD) 
SEC14 (S. cerevlsiaeKiKe 1 
glycyt-tRNA synthetase 
heparan sulfate proteoglycan 2 (periecan 
myiistoyiated alanine-rich prutein Unas 
catumenln 

tissue factor pathway inhWtor 2 ' 
piatetet/endothelial cell adhesion motec 
laminln, atpha 4 
KIAA0096 protBln 
IGF-II mRNA-Wndlng protein 3 
complement component C1q receptor 
melanoma ceD adhesion motecuJe 
SRY (sex det ermining region Y>box 4 
thrombospondln 1 

nudlx (nucleoside diphosphate linked mol 
SRY (sex determining region Y>oox 4 
Homo sapiens mRNA; cONA DKFZp586E 1 624 (f 
serine (or cysteine) proteinase Inhlblto 
thymJdylate synthetase 
peripheral myelin protein 22 ° 
WAA0015gene product 
adFenomeduflln 

ribonudease, RNase A family, 1 (pancrea 
thyroid hormone receptor inleractor 12 
nldogen2 
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100420 
100455 
100529 
100618 
100619 
100658 
100676 
100718 
100752 
100828 
100850 
100991 
101097 
101110 
101142 
101156 
101168 
101184 
101192 
101317 
101336 
101345 
101400 
101475 
101485 
101496 
101505 
101543 
101557 
101560 
101587 
101592 
101633 
101634 
101667 
101682 
101714 
101720 
101741 
101744 
101793 
101837 
101838 
101840 
101857 
101864 
101931 
101966 
102012 
102013 
102024 
102059 
102121 
102283 
102300 
102378 
102395 
102460 
102491 
102499 
102523 
102560 
102564 
102589 
102600 
102645 



D87953 

HG1862-HT1897 

HG2614-HT2710 

HG2639-HT2735 

HG2855-HT2995 

HG3044-HT3742 

HG3342-HT3519 

HG3543-HT3739 

HG4069-HT4339 

HG417-HT417 

J03764 

L06797 

L08246 

L12711 

L13977 

115388 

119871 

L20859 

L42176 

L49169 

L76380 

M15990 

M23254 

M24736 

M26576 

M27396 

M31166 

M31994 

M32334 

M35878 

M36429 

M57730 

M57731 

M60858 

M62994 

M68874 

M69043 

M74719 

M75126 

M84349 



102709 
102759 
102804 
102882 
102907 
102915 
102927 
102960 



M92934 
M93056 
M94856 
M95767 
S76965 
S81914 
U03057 
U03100 
U03877 
U08021 
U14391 
U31384 
U32944 
U40369 
U41767 
U48959 
U51010 
U51478 
U53445 
U59269 
U59423 
U62015 
U63825 
U67983 
U73379 
U73824 
U77604 
U81607 



X04412 
X06985 
X07820 
X12876 
X15729 



Hs.118693 
AW8BB941 HsJ57B9 
BE313693 Hs.334330 
AI752163 Hs.1 14599 
N24433 Hs^41567 
U56725 Hs.180414 
X02761 Hs.287820 
BE295928 Hs.75424 
TB1309 

AL046753 Hs.303649 
AA836472 Hs.297939 
J03336 Hs.82085 
BE245301 Hs-89414 
AJ439011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs.75693 
NM_005308Hs711569 
NM.001674Hs.460 
BE247295 Hs.78452 
L42176 Hs.8302 
NM.006732H3.75678 
NM_0Q5795Hs.152175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 H*89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AJ752416 Hs.77326 
AF064853 Hs.91299 
NM_004428Hs.1624 
AV650262 Hs.75765 
NM_005381 
AF043045 Hs^1008 
M68874 Hs.211587 
M69043 HsJ1328 
NMJ»3199H*326198 
A1679352 Hs.1 18625 
W01076 Hs.278573 
M92643 Hs.343586 
BE243845 Hs.75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NM_006823Hs.75209 
X96438 Hs.76095 
BE259035 Hs.118400 
BE616287 Hs.178452 
AA301867 Hs76224 
A1752666 Hs.76669 
NM-004998HS.82251 
AW161552 Hs.83381 
AJ929721 Hs.5120 
AU076887 H&28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
AJ984144 Hs.66713 
AL119566 Hs.6721 
NM.007019H3.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NW.005100Hs.788 
NM_002318Hs.83354 
AI767736 Hs.290070 
BE409861 H3.202833 
X07820 Hs.2258 
BE512730 Hs.65114 
A1904738 Hs.76053 



Melanoma associated gene 
N-myc downstream regulated 
calmodulin 2 (phosprtorytase kinase, dett 
collagen, type VIII, alpha 1 
RNA binding motif, single stranded Inter 
heat shock 70kD protein 2 
fibranecfin 1 

lnr^kyofDNAbWing1,(tomhantneg 
tosulln-tlke growth factor 2 (somatomedl 
smaD inducible cytokine A2 (monocyte ch 
cathepsinB 

serine (or cysteine) proteinase InhMo 
chemoldns (C-X-C mofif), receptor 4 (fus 
myeloid ceO leukemia sequence 1 (BCL2-r 
transketotase (Wernicke-Korsakoff syndro 
pror/tcanxwypepfidase (anglotensinase C 
G protein-coupled receptor kinase 5 
activating transcription factor 3 
solute carrier family 20 (phosphate tran 
four and a half UM domains 2 
FBJ murine osteosarcoma viral oncogene h 
calcitonin receptor-Gke 
v-yes-1 Yamaguchi sarcoma viral oncogene 
calpaln 2, (m/ll) large subunit 
selectfn E (endothelial adhesion motecul 
collagen, type IV, alpha 1 
asparagine synthetase 
pentaxiwelated gene, rapidly Induced b 
aldehyde dehydrogenase 1 family, member 
tnterceflutar adhesion molecule 2 
insulin-like growth factor binding prate 
guanine nucleotide binding protein (G pr 
ephrin-A1 
GR02 oncogene 



fifcamin B, beta (actbvblndlng proteh-2 
prttsphoflpase A2, group I VA (cytosolic, 
nuclear factor of kappa Dghtpolypeptid 
transcription factor 4 
hexoklnasel 

C059 antigen p18-20 (antigen identified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase Inhrbfto 
fatty acid binding protein 5 (psoriasis- 
transgelin 

protein kinase (cAMP-dependent, cataiyti 
immediate early response 3 
singed (DrosophilaHke (sea urchin fas 
catenin (cadherin-assoclated protein), a 
EGF-conteinlng fibufin-Gke extraceHula 
nicotinamide N-metfiyi trans (erase 
myosin IE 

guanine nucleotide binding protein 1 1 
dyneln, cytoplasmic, tight polypeptide 
spermidine/spermine N1-acetyUrareferase 
a disintegrin and metaJloproteinase doma 
myosin, light polypeptide kinase 
gfcHuman nboflnamide N-methyttransferas 
ATPase, Na+/K+ transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13, H-cadhertn (heart) 
MAD (mothers against decapen taptegte, Or 
cysteine-rtch, angiogenic Inducer, 61 
hepatitis delta antigen-mteracting prof 
lysosomal 

ubiquBin carrier protein E2-C 
eukaryofic translation initiation factor 
microsomal glutathione S-bansferase 2 
A kinase (PRKA) anchor protein (g ravin) 
lysyt oxWase-Eke 2 
gelsoCn (arnyWdosis, Finnish type) 
heme oxygenase (decyding) 1 
matrix metaDoprotelnase 10 (stromelysin 
keratin 18 

DEADAi (Asp-Glu-Ala-Asp/Hls) box pofypep 
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103011 
103020 
103029 
103036 
103056 
103080 
103095 
103138 
103176 
103195 
103347 
103371 
103432 
103471 
103967 
104447 
104764 
104783 
104798 
104865 
104877 
104894 
104952 
105113 
105178 
105196 
105215 
105263 
105271 
105330 
105461 
105492 
105493 
105594 
105727 
105732 
105767 
105882 
105936 
106031 
106124 
106222 
106241 
106263 
106264 
106366 
106454 
106634 
106724 
106793 
106799 
106842 
106868 
106890 
106961 
106974 
107030 
107061 
107086 
107216 
107385 
107444 
107985 
108507 



109001 
109195 
109390 
109456 
109737 
110411 
110660 
110906 
111018 
111091 



X52541 

X53416 

X54489 

X54925 

X57206 

X59798 

X60957 

X65965 

X69111 

X70940 

X87838 

X91247 

X97748 

Y00815 

AA303711 

L44538 

AA025351 

AA027050 

AA029462 

AA045136 

AA047437 

AA054087 

AA071089 

M156450 

AA187490 

AA195031 

AA205724 

AA227926 

AA227986 

AA234743 

AA253216 

AA256210 

AA256266 

AA279397 

AA292379 

AA292717 

AA346551 

AA400292 

AA404338 

AA412284 

AA423987 

AA428594 

AA430108 

AA431462 

M431470 

AA443756 

AA449479 

AA459916 

AA465226 

AA478778 

AA479037 

AA482597 

AA487561 

AA489245 

AA504110 

AA5209B9 

AA599434 

AA608649 

AA609519 

D51069 

U97519 

W28391 

AA035638 

AA083514 

AA121315 

AA147186 

AA156125 

AA188932 

AA219653 

AA232645 

F10078 

H48032 

HB2117 

N39584 

N54067 



AJ243425 Hs.326035 
X53416 Hs.195464 
AW800726 Hs.789 
M13509 Hs£3169 
Y18024 Hs.78877 
AU077231 Hs^2932 
NMJ)05424Hs,78824 



AU021154 Hs.76884 
AA351647 Hs^642 
AU077309 Hs.171271 
X91247 H5.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs. 156044 
AI039243 Hs.278585 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs^2575 
AI138635 Hs.22968 
AF065214 Hs.18858 
AWD76098 HS.34558B 
AB037816 Hs.8982 
AA313825 H3.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW338625 Hs.22120 
BE539071 Hs.69388 
AI805717 Hs.289112 
AL047586 H3.10283 
AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Ks.23457 
W46802 Hs.81988 
AI676765 Hs21812 
X64116 Hs.171844 
H93366 Hs.7567 
AA356392 Hs.21321 
BE019681 Hs.6019 
W21493 Hs.28329 
AL046859 Hs.3407 
AA186715 Hs.336429 
NM_014038HsJ216 
W25491 Hs.288909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF124251 Hs^6054 
BE185536 Hs.301183 
AA489245 Hs.88500 
AW243614 Hs.18063 
A1817130 Hs.9195 
AL117424 H3.25035 
BE147611 K3.6354 
NM.012331HS.26458 
D51069 Hs.211579 
NMJ505397HS.16426 
W28391 Hs.343258 
T40064 Hs.71968 
A 1554545 Ks.68301 
AB029000 Hs.70823 
AA147186 

AW56548 Hs.72116 
AF047033 Hs.132904 
AW007485 Hs.87125 
AW956580 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 Hs.33032 



earty growth response 1 

Sarrtin A, alpha (acfei-blndirg protaln- 

GROI oncogene (melanoma growth stimidati 

matrix metaltoproteinass 1 (Intsrsfitial 

inositol 1 ,4,5-tfsphosphate 3-khase B 

cycOn 01 {PRAD1: parathyroid adenomatos 

tyrosine kinase with frnntmoglobulin and 

ghHsaptens SOD-2 gene for manganese su 

Inhibitor of ONA binding 3, dominant neg 

eukaryoUc translation elongation factor 

catenin (cadherin-associated protein), b 

thtoredoxfri reductase 1 

gb:H.sapfens PTX3 gene promotor region. 

protein tyrosine phosphatase, receptor t 

ephrtn-Bt 

ESTs 

ESTs 

protein disulfide bomerase related prot 

Homo sapiens done TCCCIA00176 mRNA sequ 

B-cefl Cll/lymphoma 6, member B (zinc fi 

Homo sapiens done (MAGE:451939, mRNA se 

phosphofipase A2, group (VC (cytosoflc, 

desmoptakin (DPI, DPI!) 

Homo sapiens, done IMAG£:350S202, mRNA, 

AD036 protein 

angiotensin receptor-Eke 1 

hypothetical protein FU14957 

solute earner family 7, (caDonic amino 

ESTs 

ESTs 

hypothetical protein RJ20505 

CGM3 protein 

RNA binding motif protein 8B 

tyrosine 3wnonooxygenase/lryptophan 5 -mo 

K1AA1 002 protein 

hypofhetical protein MGC12942 

ESTs 

disabled (Drosophita) homotog 2 (milogen 
ESTs 

Homo sapiens cONA: FU22296 fls. done H 

Homo sapiens cDNA: FU21962 fis. done H 

Homo sapiens clone FLB9213 PR02474 mRNA, 

Homo sapiens cONA: FU21288 fis, done C 

hypothetical protein FU 14005 

protein kinase (cAMP-dependertt, catalyti 

R1KEN cDNA 913Q422N19 gene 

HSPC028 protein 

hypothetical protein FU22471 

Homo sapiens cDNA: FU22141 fis, clone H 

ESTs 

Homo sapiens done 25012 mRNA sequence 
novel SH2-contalning protein 3 
motecute possessing ankyrfn repeats Indu 
mitc^en-activated protein kinase 8 inter 
Homo sapiens cDNA FU 10768 fis, done NT 
Homo sapiens cONA FU13698 fa, done PL 
chloride intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductase A 
melanoma cell adhesion motecute 
podrxalyxto-flke 

prorrferalton-assoclated 2G4, 38kD 

Homo sapiens mRNA; cONA DKFZp564F053 (fr 

ESTs 

KIAA1077 protein 

gb2o38d01 al Stratagene endothelial eel 
hypothetical protein FU20992 slmOar to 
solute canter family 4, sodium bkarbon 
EH^tomain cofUalning 3 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
Homo sapiens mRNA for WAA1741 protein, 
ESTs 
ESTs 

mltogen^ctfvated protein kinase kinase 
hypothetical protein DKFZp434N185 
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111356 N90933 
111378 N93764 
111741 R26124 
111769 R27957 
112318 R55470 
112951 T16550 
113057 T26674 
113195 T57112 
113490 T88700 
113542 T90527 
113803 W42789 
113847 W60002 
113910 W78175 
113947 W84768 
114047 W94427 
115061 AA253217 
115819 AA426573 
115870 AA432374 
115964 AA446622 
116228 AA478771 
116264 AA482594 
116314 AM90588 
116589 059570 
117023 H88157 
117112 H94648 
117156 H97538 
117176 H98670 
117280 N22107 
119559 W38197 
119866 W80814 
120655 AA287347 
121314 AA402799 
121335 AA404418 
121822 AA425107 
121835 AA425435 
122331 AA442872 
122577 AA452860 
123160 AA488687 
123486 AA599674 
124059 F13673 
124339 H99093 
124358 N22495 
124364 N23031 • 
124726 R15740 
124763 R39810 
125167 W45560 
125304 Z39833 
125307 Z40583 
125329 AA825437 
107985 R66613 
125598 R66613 
125609 AA868063 
116024 AA 128075 
418000 AA128075 
126399 AA128075 
127435 N66570 
127566 A1051390 
127619 AA627122 
434190 AA627122 
128453 X02761 
128495 AF010193 
128515 AA149044 
128580 U82108 
128623 078676 
128642 L35240 
128669 AA598737 
128903 R69417 
128914 AA232837 
129087 N72695 
129188 M30257 
129226 M96843 
129265 X68277 
129345 AA292440 
129468 J03040 
129488 AA228107 
101838 AA449789 



BE301871 Hs.4867 
AW160993 Ks.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 
AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs7890 
AW880709 Hs^83683 
NM.005032Ha.4114 
AA113262 Hs.17901 
W84768 

AL035858 Hs.3807 
AI751438 Hs.41271 
AA486620 Hs.41135 
NM_005985Hs.4fl029 
AA987568 Hs.74313 
AI767947 Hs.50841 
051174 Hs272239 
AJ799104 Hs.178705 
A1557212 Hs.17132 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 Hs.172129 
W38197 

AA496205 Hs.193700 
AA305599 Hs.238205 
W07343 Hs.182538 
AA404418 
AI743860 

AB033030 HsJ00670 
AL133437 Ha.1 10771 
AA829725 H&334437 
AA488687 Hs.284235 
BE019072 Hs.334802 
BE387335 Hs2B3713 
H99093 H&343411 
AW070211 Hs.102415 
AF265555 r&250646 
NM_003654Hs.104576 
B£4 10405 Hs.76288 
AL 137540 Hs.102541 
AL359573 Hs.124940 
AW580945 Hs.330466 
AA825437 Hs.58875 
T40O64 H371968 
T40064 Hs.71968 
AA868063 Hs.104576 
AA088767 Hs.83883 
AA932794 Hs.83147 
AA088767 Hs.83883 
X69086 Hs.286161 
A1051390 Hs.116731 
AA627122 Hs.163787 
AA627122 Hs.163787 
X02761 Hs.287820 
NM_OO5904Hs. 100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.180414 
AW150717 Hs345728 
AW867491 Hs.107125 
A 1348027 Hs.108557 
NM_OG1078Hs.109225 
BE222494 Hs.180919 
AA530892 Hs. 171 695 
R22497 Hs.1 10571 
AW410538 Hs.111779 
AW966728 Hs.54642 
BE243845 Hs.75511 



mannosyl (alpha-l^lycoprotein beia- 
hypotheifcai gene DKFZp434A1 1 14 
K1AA0846 protein 
ESTs 

ESTs, HlgWy 8toi8artoT46395hypothea 
vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from done RP1-187J11 
ESTs, WeaWy similar to S41044 chromosom 
Homo sapiens cDNA FU10500 fls, done NT 
Homo sapiens mRNA for WAA 1671 p 
chromosome 8 open reading trams 4 
plasfln 3 (T Isoform) 
Homo sapiens, done IMAGE:393701S. mRNA. 
gb2h53d03a1 Soares_fetaUver_spben_ 
FXYD domah-contalnlng bn transport reg 
Homo sapiens mRNA full length Insert cDN 
endomudn-2 

sna9 1 (drosopnfla homofog), zinc flnge 

KIAA1265 protein 

ESTs 



Homo sapiens cDNA RJ11333 lis, done PL 
ESTs, Moderately similar to 154374 gene 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 
ESTs 

uveal autoantigen with coiled coil domai 
Homo sapiens cDNA: FU21409 fis, clone C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKF2p58SI0324 (i 
hypothetical protein PRO2013 
phospholipid scramblase 4 
gb:zw37e02.8l Soaies_totaUetusJlb2HF8_ 
melallothionein 1 E (funcfional) 
K1AA1204 protein 

Homo sapiens cDNA: FU21904 fls, done H 
hypothetical protein MGC4248 
ESTs, Weakly similar to 138022 hypothetl 
Homo sapiens cDNA FU14680 fis, done NT 
ESTs, Weakly slmiar to S64054 hypothetl 
DEAD/H (Asp^lu-AI&AspMs) box polypep 
Homo sapiens mRNA; cDNA OKF2p5B6N0121 (f 
bacutovtral tAP repeat-containing 6 
carbohydrate (keratan sulfate Gal-6) sul 
caipatn 2, (m/ll) large subunft 
netrin4 

GTP-blndlng protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA OKF2p564F053 {fr 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
carbohydrate (keratan sulfate GaW) sul 
transmembrane, prostate androgen induced 
guanine nucleotide binding protein-tike 
transmembrane, prostate androgen Induced 
Homo sapiens cONA FU13613 fls, done PL 
ESTs 
ESTs 
ESTs 

fibronedin 1 

MAD (mothers against decapentaplegte. Or 
type i transmembrane protein Fn14 
solute carrier family 9 (sodium/hydrogen 
CTUgene 

enigma (UM domain protein) 
heat shock 70kD protein 8 
STAT Induced STAT Inhibitor 3 
plasmalemma vesicle associated protein 
hypothetical protekiPP1057 
vascular ceil adhesion molecule 1 
Inhibitor of DNA binding 2, dominant nag 
dual specificity phosphatase 1 
growth arrest and DrW^amage^ndudUe, 
secreted protein, acidic, cysteine^ 
methionine adenosyltransferase II, beta 
connective tissue growth factor 
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413731 
129557 
129619 
129627 
129762 
129884 
130018 
130147 
130178 
130262 
130431 
130495 
130553 
130638 
130639 
130657 
130686 
130776 
130818 
130840 
130899 
131002 
131080 
131084 
131091 
131107 
131182 
131207 
131319 
131328 
131509 
131555 
131564 
131573 
131692 
131756 
131859 
131909 
131915 
132046 
132050 
132151 
132164 
132187 
132303 
132314 
132358 
132398 
132421 
132490 
132520 
132546 
132610 
132716 
132840 
132883 
132968 
132989 



133071 
133076 
133099 
133147 
133149 
133161 
133200 
133220 
133260 
133295 
133349 
133391 
133398 
133436 
133454 
133478 
133491 



AA449789 

W01387 

AA610116 

AA258308 

AA460273 

AA286710 

T68873 

D63476 

M62403 

X55740 

L10284 

AA243278 

AA430032 

H16402 

D59711 

T94452 

M431571 

R79356 

AA280375 

Z49269 

Z41740 

AA121543 

J05008 

AA101678 

T35341 

N87590 

AA256153 

W74533 

U25997 

V01512 

X56681 

AA161292 

AA491465 

AA046593 

D50914 

045304 

M90657 

W69127 

AA316186 

AA384503 

AA136353 

AA044755 

U84573 

AA058911 

AA620962 

AA285290 

X60486 

R31641 

AA469190 

F13782 

AA257993 

M24283 

AA443114 

T352S9 

N23817 

AA047151 

N77151 

AA480074 

Y00787 

T99789 

W84341 

L09209 

D12763 

T16484 

AA253193 

AA432248 

X82200 

AA083572 

L0O352 

N75791 

X57579 

3(02612 

H44631 

AA090257 

X83703 

L40395 



BE243845 Hs.75511 
AUM5404 Hs.46366 
AA209534 Hs.284243 
T40064 Hs.71968 
AA453694 Hs.12372 
AF055581 Hs.13131 
AA353093 

D63476 Hs.172813 
U20982 Hs.1516 
BE245380 Hs. 153952 
AW505214 Hs.155560 
AW250380 Hs.109059 
AF062649 Hs252587 
AW021276 Hs.17121 
A1557212 Hs.17132 
AW337575 Hs.201591 
BE548267 Hs.337986 
AF167706 Ha.19280 
AW1S0920 Hs. 19928 
BE048821 Hs.20144 
AKJ77288 Hs.296323 
AL050295 Hs.22039 
NM.001955Hs.2271 
KM_017413Hs^03084 
AJ271216 Hs.22880 
BE620886 Hs.75354 
AI824144 Hs.23912 
AF104266 Hs.24212 
NM.003155H3.25590 
AW939251 Hs.25647 
X56681 Hs.2780 
T47364 Hs.278613 
T93500 Hs.28792 
AA040311 Hs.28959 
BE559681 Hs.30736 
AA443966 Hs.31595 
AW960564 

NMJ)16558Hs.274411 
AI161383 Hs.34549 
AI359214 Hs.179260 
AI267615 Hs.38022 
BE379499 Hs. 173705 
AI752235 Hs.41270 
AA235709 Hs.4193 
BE177330 Hs.325093 
AF1 12222 Hs.323806 
NM.003542HS.46423 
AA876616 Hs.16979 
AW163483 Hs.48320 
NH.001290Hs.4980 
AA257992 Hs.50651 
M24283 Hs.168383 
AA160511 Hs.5326 
BE379595 Hs.283738 
BE218319 Hs.5807 
AA373314 Hs.5897 
AF234532 Hs.61638 
AA480074 Hs.331328 
Y00787 Hs.624 
BE384932 Hs.64313 
AW946276 Hs.6441 
W16518 Hs.279518 
AA026533 Hs.66 
AA370045 Hs.6607 
AW021103 Hs.6631 
AB037715 Hs.183639 
NM.0O6074HS.3185O1 
AA403045 Hs.6906 
AI147861 Hs.213289 
AW631255 Hs.8110 
AW103364 Hs.727 
NM_00O499Hs.72912 
BE294063 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 
BE619053 Hs.170001 



connecfr/e tissue growth factor 
WAA0948 protein 
tetraspan NET-6 prutsk) 

Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
tripartite motif protein TRIM2 
lysosomal 
metaDothJoneln 1L 

PAK-interacfing exchange (actor beta 
insufijvCte growth factor-binding prote 
SnudeaWase{CD72) 
calnexin 

mttochondrtal ribosorna! protein L12 
pftuitary turru^-translomikig 1 
ESTs 

ESTs, Moderately similar to I54374 gene 
ESTs 

Homo sapiens cDNA FU 10934 fls, done OV 

cystetne-rlch motor neuron 1 

hypoftefca) protein SP329 

smaO Inducible cytokine subfamily A (Cy 

serum/glucocorticoM regulated kinase 

K1AA0753 protetn 

endotheDn 1 

apelln; peptide Ugand for APJ receptor 

(fipeptioVtpepfJdasein 

6CN1 {general control of amlno-acid synt 

ESTs 



stannlocaWn 1 

v-fos FBJ murine osteosarcoma viral onco 

Jun D proto-oncogene 

Werteron, alp ha -inducible protein 27 

Homo sapiens cDNA FU1 1 041 fis, done PL 

ESTs 

KIAA0124 protein 
ESTs 

transmembrane 4 supertamiiy member 1 

SCANdomakvconbiningl 

ESTs, Highly similar to S94541 1 done 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FU22050 fls. clone H 

pfocoaagen-lysine, 2-oxogIutarate 5-dto 

DKFZP58601624 protein 

Homo sapiens cDNA: FU21210 fis, clone C 

plnin, desmosome associated protein 

H4 histone family, member G 

ESTs, Weakly similar to A43932 mucin 2 p 

double ring-finger protein, Qorfin 

UM domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

intercelkilar edhesion motecute 1 (COM) . 

amho add system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPase Rab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
myosin X 

r^potbetical protein FU13213 



ESTs, Weakly similar to AF257182 1 G-pro 
Homo sapiens mRNA; cDNA DKFZp586J021 (fr 
amyloid beta (A4) precursor-like protein 
InterleuWn 1 receptor-flke 1 
AXJN1 up-reguiatod 
hypothetical protein FU20373 
rr/pothetaj protein FU10210 
Homo sapiens mRNA full length insert cON 
Homo sapiens cDNA: FU23197 fis, done R 
tow density lipoprotein receptor {famBI 
L-3-byo^yacyi-Coenzyrne A dehydrogenase 
Inhfoln, beta A (acfivln A, actMn AB a 
cytochrome P450, subfamily I (aromatic c 
immediate earty pcoteh 
hypolheUcal protein MGC5618 
cardiac ankyrin repeat protein 
eukaryofic translation initiation factor 
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133584 
133590 
133617 
133651 
133671 
133676 
133681 
133722 
133730 
133750 
133802 
133825 
133838 
133859 
133889 
133960 
133975 
133977 
134039 
134075 
134081 
134164 
134203 
134238 
134299 
134332 
134339 
134343 
134381 
134403 
134416 
134493 
134558 
134817 
134983 
134989 
135052 
135062 
135069 
135071 
135073 
135170 
135196 
135348 



AA227913 

X52947 

M11313 

L14837 

M60721 



T67986 

AA148318 

U97105 

T25747 

K02574 

D78577 

X53331 



X95735 

L16862 

U44975 

M97796 

U86782 

AA099391 

M19267 

D29992 

L19314 

S78569 

U28811 

L77886 

C14407 

M60278 

R61509 

AA487558 



AA478971 

D50683 

U56637 

M61199 

M28882 

X15183 

S53911 

U20734 

D28235 

AA236324 

AA148923 

AA174183. 

AA456311 

L08069 

AA452000 

AA282140 

J02854 

AA442054 



AW880841 Hs.96903 
NM_000165Hs.74471 
AU077051 Hs.74561 
NM_003257HsJ4614 
M60721 Hs.74870 
D90209 Hs.181243 
T70956 Hs.75106 
BE244334 Hs.75249 
A1301740 Hs.173381 
AW503116 Hs.301819 
AW247252 
AI352558 

AW969976 Hs.279009 
BE242779 Hs.179526 
BE410769 Hs.75873 
AW239400 Hs.76297 
BE616902 Hs.285313 
BE222494 Hs. 1809 19 
U86782 Hs.178761 



U48959 
M19267 
C18356 
A1125639 



Hs.211582 
Hs.77699 
Hs.295944 
Hs.250666 



NM_002290Hs.78672 
NM_012201Hs.78979 
AL034349 Hs.79005 
AW245540 Hs.79516 
AA161219 Hs.799 
AA102179 Hs.160726 
AW580939 Hs.97199 
Hsi1B75 
Hsi1988 
Hs.82028 
Hs.184270 



R70429 
D50683 
AI557280 
AA334551 
X68264 
M30627 



HS211579 
Hs.269088 



NM_001773Hs.85289 
AU076592 Ks.198951 
D28235 Hs.196384 
AW966058 Hs.92381 
AL136653 Hs.93675 
AK000967 Hs.93872 
AA876372 Hs.93961 
W27190 Ha.94 



W55956 
T53169 
C03577 
U80983 



Hs.94030 
Hs.9587 
Hs.8615 
Hs.268177 



p53-induced protein 

gap Junction protein, alpha 1, 43kD {con 

aiptia-2-macrogbbuCn 

tight Junction protein 1 (zona ocdudens 

H2.0 (DrosophBaHike homeo box 1 

activating transcripQon factor 4 (tax-r 

dusterin (complement tysis kiMbKor, S 

ADP-ribosytaOon factor-lite 6 tnteracfi 

zinc finger protein 146 
nucteostde phcsrjhorytase 
tyrosine 3wnonooxygenase/lnrp1ophan 
matrix 61a protein 

unregulated by 1,25-aThydroxyvitarrdn D-3 
zyxfn 

G protein-ccupled receptor kinase 6 
core promoter element binding protein 
[nhlbaor of DNA binding 2, dominant neg 
26Sproteasome-assodated padlhomobg 
myosin, light polypeptide kinase 
tropomyosin 1 (alpha) 
tissue factor pa&way inhibitor 2 
hairy (Drosophlla}4>omoiog 
laminin, alpha 4 
Goigi apparatus protein 1 
protein tyrosine phosphatase, receptor t 
brain abundant membrane attached signal 
diphtheria toon receptor (heparin-Wndl 
Homo sapiens cDNA FU1 1680 fis, done HE 
complement cornponerrtClq receptor 
growth factor receptor-bound protein 10 
disabled (OrosophSa) homoiog 2 (mitogen 
transforming growth factor, beta recepto 
capping protein (acfJn flamenO muscle 
sperm specific antigen 2 
melanoma cell adhesion molecule 
heat shock 90kD protein 1, alpha 
CD34 antigen 
jun B proto-oncogene 
prostaglandin-endoperoxide synthase 2 (p 
nudlx (nucleoside diphosphate linked moi 
decidual protein induced by progesterone 
K1AA1682 protein 

Homo sapiens mRNA; cDNA OKFZp667D095 (fir 
OnaJ (Hsp40) homoiog, subfamily A, membe 
Homo sapiens mRNA; cONA OKFZp586E1624 (f 
Homo sapiens cDNA: FU22290 fis, done H 
myosin regulatory light chain 2, smooth 
phospholipase C, gamma 1 (formeriy subty 
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TABLE 4 A 



Table 4A shows the accession numbers for those pkeys lacking unlgenelO's for Table 4. The pkeys in Table 7 lacking unJgeneiD's are represented within 
Tables 1-6A. For each probeset wa have listed file gene cluster number tram which the oligcraideoildes were designed. Gene clusters were compfied 
using sequences derived from GenbankESTs and mRNAa. These sequences were clustered based on sequence stmBarfty using Clustering and 
Alignment Toots (DoubleTwlst Oakland California). The Genbank accession numbers for sequences cornprising each duster are listed In the ■Accession' 



1 0 Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



15 Pkey 

100752 

20 



CAT Number Accession 



33207.21 T81309 BE019033 R94181 BE019198 ML.OO0612 J03242 AW41 1299 BE300084 8E297544 R94182 AW630108 T53723 
D58853 H78073 H80594 BE299560 T48899 H70196 M17426 M77077 S77035 H58384 H61664 H76540 T84527 C17198 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17862 T71815 BE299561 BE464561 XD6260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R88116 BE313457 H43500 
T48617 BE313761 H773Q9 AI207601 X06159 H40413 X03425 T87663 R10627 XQ3562 M14118 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R1 1555 H74260 R76988 R28338 H72534 H72464 
H82031 N72478 N45355 AW411300 R89113 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T85121 
H59939 W65299 K78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N50464 W00943 
N77189 R89257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601 052 BE299081 
H68198 H52276 BE207832 N91192 H70332 XD7868 X07868 H69464 H53782 H73710 R80435 AA553384 AW884176 
N53475 T71662 AW954036 AW954033 AA552931 H93208 AA430218 AA553476 AI918470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 AM1 1890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428 AI039521 H92969 N59389 HS0533 H72933 T90630 AA41 1891 N55000 H74225 AA34029Q AW957051 T54316 
AA340437 H57125 H58908 H79027 H63450 N74623 R93425 H68714 H6B758 N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91Q56 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H55256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T40616 
H47084R95723AA634316AA862781 H77310R91389 H93111 R92767T54512R89341 H70333H57817H82941 H62032 
N52638 H58385 T91796 H51088 AA340292 T49918 H81230 R38121 N50411 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 A1200549 
AA342640 BE288855 BE250229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H65255 W05198 
AW889144 N76677 H71702 H68036 H71915 R91612 R87807 H58059 AI133328 AI247866 AA621443 AW881050 
AA700847 AA340413 AW878608 AW881 181 AW878249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AWB85492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63868 AWB84986 AW87B71 3 AW87B685 R36391 AW878694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AW878690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW8835S5 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW883474 AWB83355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW883641 AW883567 AW883481 AW882983 AW882982 AW882465 AW883419 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 
1 17156 145392.1 W73853 AA9281 12 W77887 AWB89237 AA148524 AI749182 AT754442 AB38392 AC53102 AW79403 AI370541 AI697341 
H97538 AW188021 AI927669 W72716 A1051402 AI188071 AI33590Q N21488 AW770478 W92522 AI691028 AI913512 
AJ 144448 W73819 AA604358 N28900 W95221 AI868132 H98465 AA148793 
131859 3672.1 AW960564 AA092457 T55890 D56120 T92525 AJ815987 BE 182608 BE182595 AW080238 M90657 AA347236 AW961686 

AW176448 AA304671 AW583735 T61714 AA316968 AM46615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 082729 058990 BE619182 AA315188 AA308636 AA1 12474 W76162 AA088544 H52265 
AA301631 H80982 AA113786 BE620997 AW651691 AA343799 BE613669 BE547180 BE546656 F11933 AA376800 
AW239185 AA376086 BE544387 BE619041 AA452515 AA001806 AA190873 AA180483 AA159546 F00242 AB40609 
A1940602 AI189753 T97663 T661 10 AW062896 AW062910 AW062902 A1051622 AI828930 AA102452 AI685095 AI819390 
AA557597 AA383220 AI804422 AJ633575 AW3381 47 AW603423 AW606800 AW750567 AW510672 A/250777 AA08351 0 
AW629109 AW513200 AA921353 AI677934 AI148698 A1955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 AI474275 AW205937 
Ai052270AW388117 AW388111 AA699452 AI242230 N47476 H38178 AA366621 AA113196AA130023 H39740T61629 
A1885973 AW083671 AA179730 AA305757 A1285455 N83956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771981 A1285092 AB91386 BE392486 BE385852 AA682601 AI682884 AA345840 T85477 AA292949 
AA932079 AA098791 082607 T48574 AW752038 C06300 
1704098J R20840R20839 

1227.6 BE273749 BE397561 BE387189 AUD37858 AL037878 A1963094 BE259216 AA01 1363 AL036189 BE562325 AA251 169 

BE617431 N98537 AA158093 ALO47800 M34539 KMJW08Q1 AA312140 D16971 AA158904 AA3071 14 AA312803 
T09203 AW629886 AL048504 BE388578 AA220957 AA1 58364 BE267385 AA294971 C18055 BE241757 AA115056 
AI936769 BE378435 8E206971 AW674924 BE622060 AA604674 AA1 15273 AW402159 AA338608 BE568819 M80199 
X55741 AA37511 1 AA376016 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
BE543239 AA082517 A1201645 AJ201642 AI192622 N40104 AA370921 BE547569 AI969602 AA302038 AI197890 
AW268354 AJ014938 W45448 AI541395 AA037272 BE538826 AL039813 BE536130 AA299355 AW805147 AW974624 
H 53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AI498844 
AV662092 AW904105 AA011375 BE315214 H99302 BE537893 N32299 AW855829 AI291320 BE078322 AJ301395 
AA303362 N32719 AA358328 AA357877 AT952540 K56279 H02758 H02048 AW805233 R82224 AA410772 AA291352 
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BE171109 N69935 BE169248 AA361 173 H44978 BE617887 D52560 M084043 W03595 R67219 N36477 N42924 R67104 
H44901 H79695 W21105 AA393988 W30899 AA316098 BBB22896 W46872 AA442678 BE544893 BE5401 12 BE621873 
AA338067 N550S2 BE398154 BE621210 AA74076Q C03739 C03206 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AAD29796 AA934960 H18567 AA442969 H49781 H46871 AA035395 AA056185 AA149378 
AAB43080 AL135479 AA292329 AA654337 AA041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA513790 AA563962 AA338076 W44441 AA293276 W47373 AA625879 W3088B AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AM21425 AW771456 AB39329 AA304424 AA605236 AA936934 
AA587673 AB09162AI697301 AJ479995AI679814AJ361950 AW189125AI955888AI986019 BE301O19 AW84792 
AI31021 1 AW189307 AJ022070 AW977204 AI146825 AW190163 AW303281 AI828345 BE046043 AW029257 AA482268 
AI246507 AJ420729 AW084932 AW4395 14 AI890487 AW439692 AIS23898 AI185612 A1659953 AI889773 AA687527 
AW072634 AW2621S3 AW467371 A1613269 AI679238 054404 AA158103 AW105527 AW149739 AW150361 AW268387 
AW1 17708 AJ951682 AI687440 AW674285 AA878365 A1587082 AA732095 AA019899 W45661 AAS27300 BE613304 
AA765891 AA612935 AJ814658 AW316916 R66594 AA514640 AA025040 AA031472 AW732076 AA029797 A1244560 
AM 28734 AW381 720 AI092360 AC63283 AW813175 AJ890675 AI720156 AW831348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078661 AC92052 AW381707 AI147854 AW381743 AA158905 AA303258 AA888144 AW1 95967 
AA428708 AA989559AA6 17731 H19882 BE543418 AA830386 AA421302 W58652 T94995 AJ869743 AI679145 
AW085971 N38425 AA765136 A1347027 AI356955 AA928038 A1679717 AA458459 AA679281 AI367973 AI270041 
AA765135 AA732793 AJ798447 AA668648 AA251008 AI984538 A 1 401 737 AA056185 BE043308 AWB62375 AI3021 10 
N50724 W96332 BE537047 N26983 A1567172 AA765236 AW573237 N297B4 AA534275 AA084044 AW087973 
AW300766 T63398 W46823 R39790 AB64185 AW298582 AA454814 AW089878 N67751 H05982 N23140 AJ362647 
AI302O86 AI767772 N25755 H531 14 AA708133 T9351 1 AA429291 AA935294 AA987647 W02803 R66595 AJ880795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA23531 2 W52205 N93444 R82180 
H02759H79696AW(J88894H56079AA961143AW067776AVra73745AA016311AW 

W47374 T64155 AA298092 A1698626 AA558158 AA258088 AW794259 H01983 AA 149267 AA485076 AA975856 H44938 
AA035396 AI955555 H46289 AA488161 AI831222 AA359047 AW794253 A1806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 A1005160 BE242548 AW992803 AW878644 AW878666 T27742 
R82471 AW517604 AW472738 A1282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 AI904376 AI832590 H 19484 R00890 AI627677 AA302287 A1869451 AI734855 AI7O8073 
A1832902 AA5851B4 AW204299 AA055565 D12417 D1 1975 T63543 AW664099 R54423 BE612712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 AI076813 AI022644 R02585 X52220 AW296894 AA825671 
AJ699321 AI393601 AW592611 AI146747 AA608921 AA158365 AW590007 AA354519 020081 R02704 AW798339 
M92422 AA094903 AA007676 

133681 13893J AI352558 Z82248 X78138 NMJW3405 AU077248 AA223125 S80794 D78577 AJ124697 AW403970 BE614089 BE296713 

BE621334 L20422 X80536 D54224 054950X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA0B41 59 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H11706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA32291 1 AA322981 H60980 N85248 N31547 
H79624T11718 W85826 AW834663AW894624 8H167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151087 BE002724 AA608688 H82692 BE155392 AW383636 BE 1 55394 AA487004 AW383504 A1342365 R82553 
W16498 BE155344 All 43338 R89901 AA322873 AW340648 R25364 AA367935 A1559406 AA033522 AA374252 
AW835019 A1922133 AI697089 N99662 AW169078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
A1Q39008 AIB29449 AA583503 AI635674 AW131655 AW7382Q AW273118 AW900930 AA908944 AI688035 AW1 70272 
AI082545 AW468176 AI608761 AW82748 AI91 1682 A1248943 AI831016 AA192465 A1218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AW70204 AI188296 AI421367 AI125315 AN87141 AA629032 AA740589 AI554181 
AA150830 AI248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
AI205606 AA720684 A1872093 AW150042 AL1 20538 AA21 9627 AA988608 C21397 AI359337 H25337 A1089749 
AA605146 AI359620 AA150478 AJ359738 AW383642 AW995424 AJ766457 R56892 AI089839 W61343 N69107 W46459 
AA5B5955 N20527 A1279782 W46596 AA776573 H23204 AI866231 AJ083995 N21530 AA126874 D82630 W65437 
AJ086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA 15561 5 AW383529 
M491520 AW028427 M171496 AI469689 AW664539 ^ ^ 

AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 A1917740 A1655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H1 1617 N79982 AJ174338 AJ904207 AJ904203 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 A1085941 AW152629 

134403 17037.1 AA334551 BE008229 AA307537 AW961156 AW995894 AW995826 NM.0067S1 M61 199 AA045603 AL036372 AV645606 

AI688095 AW351901 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AW023134 AW994988 AW9949S9 BE090429 R23614 A1567932 H03726 
H01 101 H01867 AA548743 AI671B06 AW872949 AW872941 AA742447 AI199788 AA045604 AI637465 AI741796 
AW242217 AW131463 AI765302 AI683923 AA889762 AJ804689 AI986437 C06049 BE502340 AJ695651 AM91970 
AA496804 AA281008 AA665699 AW73814 BE301445 AA707837 AA551925 Al0 17348 AI208185 AA775203 AA156296 
AA557463 H95441 AA768547 AW769358 AA991197 AA181954 AI091389 AI147289 AW771837 A1638582 AA84441 1 
AJ374750 T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720H01102R67868 H01868 N66438 R48580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R13355 AA281174 AA428908 

126872 142696J AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 
8E01 1367 BE011368 BE01 1362 BE01 1215 BE01 1365 BE01 1363 

121335 279548J AA404418 A1217248 

130018 18986J AA353093 AW957317AW872498Ai560785AI289110AW135512X97261 T68873 

121822 244391J AI743860 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 A1611424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AJ611611 AB11407 AB11452 W56437 A1284566 
AJ583349 AW183058 At308085 A1074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 A1432414 AI984994 AI539335 AA401550 AA358757 A1609976 AA442357 AA359393 AA437046 
AA370301 AM29328 AW272055 AI580502 AI832944 AI038530 AA425107 AW14986 A1148349 AW237721 AW779756 . 
AW137877 AI125293 AA400404 R28554 

123523 genbanKjAA608588 AA608588 
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123533 genbanUA608751 AA608751 

125091 genban)LT91518 T81518 

123964 genbanLC13981 C13961 

102491 ento«JJ51010 U51010 

118475 genbanLN66845 N66845 

118581 genbankJJ58905N689Q5 

113947 genbaniLWS476fiW84768 

101447 entoLM21305 M21305 

101667 13349,1 NM.005381 M60858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA1 87806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA30S526 AA315461 AL036757 AW37371 1 AW403124 AW403640 
AW377084 T27360 H82638 F06957 AW377051 AA554779 AA378568 AA096007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R 12952 AW373735 AW364155 AA056183 W39185 
AW382708 K32488 AF1 14096 AW375993 A1133569 W52561 AA603040 AA133710 A1928796 AW176370 AA8275 19 
AW338437 AA521142 T29341 AI800461 AW317002 AA703914 AA860830 A1859203 AI445772 AA714334 AJ817066 
AI832027 AW510442 At635802 AW088306 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 
AA074O40 AA931657 AI051154 AW410203 AB21644 H17434 A1832330 AW404836 AB25038 AA088423 AA954166 
AA5B0453 AW021292 AC67215 AW080082 AW383778 A1933053 AB19097 W31557 N90245 AA931591 AA563995 
F36352 AA056184 AA476294 AA64 1327 AA533550 AJ749630 W58323 AA5691 1 9 AA508573 AJ8O9050 AJ378996 
AA411362 AW407505 AA938104 AA074041 AA632876 AW1 93748 AA507873 AC70128 AM72365 AA41 1363 A1523216 
AI719965 A18 16302 AA182581 AI707990 AA133588 AI758537 W60253 AJ460308 AA135423 AI083904 F04188 N89693 
AW408776 AI678595 A1270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 020218 A1273980 
AA159028 F03226 AW247914 M26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AA962015 F0355B 
AJ752394 AW409913 AW248396 AJ816463 AI752393 AA325370 AA2630B9 AI570130 AJ971951 AI160658 AJ3S7360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AJ185752 W79364 AA088520 AA152476 
AW351940 AW373683 AI940524 AW374953 T56500 N24329 AI940720 AW374933 AW374947 AW391913 AJL138337 
AW376241 AW062943 F26666 AW41O202 AW062958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AM38973 AI305271 AA433818 AA612807 AJ831809 AI940409 
AA158663A1572988 

108931 genbanLAAl47186 AA147186 

103138 entTKLX65965 X65965 

103432 entreO©7748 X97748 

119174 genbank_R71234 R71234 

133678 11 235 J AW247252 AA346143 NMJW0270 AA381085 N91995 X00737 AA381079 AA296473 AA296110 AA315735 AA311617 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H32039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W86334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 AJ024906 AJ096488 A1885005 AA194872 N90489 A1452544 H72411 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N3061 1 
AI979189 AI280889 AW273191 R66531 AI285845 AJ675927 AM21990 AW190879 H37794 AA699667 H68427 AA954388 
AJ188757 Al 140048 AA430382 AI204151 AW247864 AA559099 A1431420 AA548276 A1149466 AA772669 AA694388 
AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA913603 AW51 1745 A1500697 AA814922 AA835040 
T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 A1597614 AA970746 AA972548 
■ AI377116 R62962 H16737 R89070AA731329 R66532 N54354 AJ818832 H81944 N71567T95122 W86463 AA437095 
AJ431999 AJ915724 N63851 AI674743 AA457307 AA21 1475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AJ269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 AI478666 T27651 
AI699770 AW025156 H69719 AJ984717 N69225 AM59856 AA953577 A1424691 H13843 R22404 AJ873796 A1336002 
N70898 AI420854 AA541792 AA346142 A1000814 AI828348 AA045090 T51257 N90434 H13890 N73184 AI708083 
AA781606 AA329050 AA339985 R6B964 H64795 W04186 H16845 

119416 genbanlLT97186 T97186 

119559 NOT_FOUND_en1reA.W3B197 W38197 

123473 genbanLAA599143 AA599143 
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TABLE 5: 



Pkey Unique Eos prabeset identifier number 

Accession: Accession number used for previous patent filings 

ExAoar Exemplar Accession number, Genbank accession number 

UnigenetD: Unrjane number 

UnigeneTiae: Unfcene gene file 



Ptey Accession ExAccn UniGene UnigeneTWe 

115819 AM26573 AA4B6620 Hs.41135 AA486620 

132837 058024 AA370362 H357953 AA370362 

101545 M31210 BE246154 Hs.154210 BE246154 

102638 X06256 NM_002205Hs.1 49609 NM.002205 

101192 L20859 BE247295 Hs.78452 BE247295 

102915 X07820 X07620 Hs.2258 X07820 

105330 AA234743 AW338B25 Hs22120 AW338625 

107385 U97519 NM_005397Hs.16426 NM.005397 

102024 U03877 AA301867 Hs.76224 AA301867 

134416 M28882 X68264 Hs.211579 X68264 

103036 X54925 M13509 Hs.83169 M 13509 

104865 AA045136 T79340 Hs.22575 T79340 

106124 AA423987 H93366 Hs.7567 H93366 

105330 AA234743 AW338625 Hs.22120 AW338625 

109001 AA156125 A1056548 Hs.72116 AI056548 

104764 AA02S351 AI039243 Hs.278585 AI039243 

133200 AA432248 AB037715 Hs.183639 AB037715 

105263 AA227926 AW388633 Hsj6682 AW388633 

105178 AA187490 AA313825 Hs^1941 AA313825 

109456 AA232645 AW956580 Hs.42699 AW956580 
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TABLE 5A 

Table 5A shows the accession numbers for (hose pkays lacking unkjenelD's for Tabte 5. The pkeys In Table 7 lacking unigenelD's are represented wiWn 
Tables 1-6A. For each probeset we have listed the gene duster number from which the oBgonudeofidss were designed. Gene clusters were compiled 
5 using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 

Alignment Tools (DoubieTwlst, Oakland California). The Genbank accession numbers for sequences comprising each cluster are feted In the •Accession" 
column. 



10 Pkey: Unique Eos probeset Identifier number 

CAT number: Gene duster number 

Genbank accession numbers 



15 Pkey 
115819 



CAT Number Accession 



10241 J AA486620 AF205940 AA297524 AB034695 M081335 NM016242 AA 188323 AA297537 H88204 AW953081 W31695 
AW582203 AA248250 AW68121 1 AA426230 AA464807 AA426155 N44141 AA347390 AA770661 AI333225 N36138 
AW665724 AA431B94 AI374976 AI400254 AB38446 AA166695 H88205 W04527 AA487066 AI051414 AA918383 
20 AA426573 AA425620 AW438654 AA090513 BE 167284 BE167291 AI301 726 

102024 14505.1 AA301867AW957981 R27614 AA15580B AI920990 AI74071 1 AA301026AA301015AI220981 AI857670AI537140 

AW015210 AA030000 W46890 H44021 AI355967 AJ651735 AA058479 AA146932 T58265 R85890 AA047810 AA017387 
AW026093 AA971 133 AI827263 A1056416 AD55994 Al 127691 H46603 U03877 NM.004105 AA157357 H42844 
AA146824 AA187709 AA187269 AA304348 AA147292 AA381687 AA156041 AA330636 R32929 AA321130 AW9S0260 
25 AA082157 AA029129 AA303708 AA028155 031561 T84689 AA302493 BE153057 BE153181 W39408 AA187200 

BE153250 AW383337 AW382622 AW382647 AW750072 BE 1 53060 A W382630 AW371865 AW392464 AW382664 
AW382658 AW382650 H61647 AW385075 AW365049 AA373397 BE072779 BE072781 Z30254 W24361 BE153254 
AAO40442 BE072729 BE072731 N94740 AA146945 AW802737 AI826799 AI085395 R34034 H65140 AA0828C0 H88275 
AA147824 R63882 W80899 AA296413 AI765300 AI862426 AW022055 AW300003 AI743784 AI862635 AI985428 
3 0 AA147764 AW573245 AW190290 A1040898 D57613 N63457 AA148082 AI028456 AA1461 10 AW814489 N75105 

AW629443 AA704122 AW582220 AA181240 AA057495 AI418224 AC61 751 AW388595 AI472205 AW470672 AA102546 
AA789046 AA182416 AA062668 AW300732 AC88220 AA181982 AA146825 AA028130 AI985522 AA303344 AA081313 
N69082 AA182035 AI867128 AA100902 AA605087 N67178 AW020324 AW890446 AI472191 AI335691 AI597837 
AI081 143 AI335681 AA040443 AI128067 AI678244 AA018303 AA157260 W80792 AI934590 AIO96430 T54343 AI446350 
3 5 AA1651S6 AA780683 AA603631 AA047787 AA968580 AA912645 AW890504 AW02691 3 D56983 H52088 AA156121 

R30848AW023036 AI590960 N67345 AI753225 AI753283A1 183768 AA147818 H89101 AI362141 H89205AI147711 
AA321 129 AA668622 AA343479 AW069438 AI422376 AW629270 AA013413 AI221948 AA970605 N52335 K38366 
T91180 AA657841 AA017386 AA152227 AA187593 AI913340 AI719313 AI969943 AI701271 AJ004328 AI868348 N93659 
H65093 H25736 D57007 D56957 C00987 D61839 056661 A1472137 AI971002 D56971 BEO48830 057972 A1589286 
40 AJ361055 AI361071 AI292223 AA155898 D57139 D57981 057345 AI420034 057332 057959 AA875933 R33493 N67558 

D58353 AA188394 AA147966 AI160640 A1363165 K40638 AA578137 AW950265 AA300943 AI128999 H46584 AA917355 
N57820 AA320504 H51959 H25737 

101545 24607J BE246154 M31210 NM.001400 AA193392 NM.016537 AF233365 AF022137 H27787 AA370448 F05373 T27666 W21494 

AA036907 AI249956 N93476 F01623 AA304390 AA308808 

45 109456 180833J AW956580 AA886361 A1147670 AI0901 15 AI168683 AA232645 H99504 AA374707 AA380875 AW139567 AI735132 
BE439385 AW629780 N28322 AA232789 AA232790 N73285 
103036 17145J M13509 X54925 NM.002421 M16567 X05231 M15996 W39354 AA186634 AA852324 AA187507 AA081 149 AA186524 

AA187264 AA187361 AA386155 AA186973 AA374217 U76045 AA081230 AA188049 AA186393 W56827 AA852602 
AA157468 AA30B204 AA186754 AA186808 AA082516 AA304334 AW376428 BE439384 AW376420 AA156273 T18504 

50 AA186521 W49496 AW084608 AA083575 AA372360 AW963590 AA132297 W47445 M186376 AA157628 AW003999 

AI037890 AI85806O AI589010 AI743739 AI452673 AW304188 AW117854 BE439933 AA157416 AW778966 A1038497 
AA081006 AA100829 AA181048 C02231 T27821 W23960 AW954802 A1471432 AW801296 AW801289 AW801603 
AW801523 AW801292 AW801542 AW801601 AA181134 AI445147 AA191501 AA582862 N94407 AI147810 AA161680 
W49497 W52714 AA188249 AB32881 AI082493 AA503656 AA182682 AW801393 AA 182830 AA181882 AA182826 

55 AI613182 N94510 W47343 AI085755 AI076956 A1918426 AA081208 AI282835 AA147528 AI081490 AI654536 AA181675 

AA081282AA186389 C06085AA083542 AI800644M157642M101069AA1 57752 AA158121 AA143331 AA081283 
AAB52603 AA188296 A1932880 AW449628 AA 187348 C02091 AA514656 AA082736 AA308786 AA143201 M16567 
133200 28960.1 AB037715 AI351347 AI375796 AI884765 AL121 124 W01068 AI807275 T95240 R42807 AW515645 AI057314 AI033520 

AA057671 N70215 AA054215 AW204183 AA552149 T95130 AW796310 AI866520 AW275564 AW796308 AI637901 

60 AW197404 T78406 AA456232 AW206463 AA779800 AI052696 AA026744 AA454623 AW470729 R45490 AW770258 

AI038393 AI290170 AA722734 AL121125 R41608 AI862414 AA838611 R45582 AI278083 BE466849 BE219944 
AA418030 BE041555 AA578572 T16528 AW008344 Z39782 AI244848 AW137344 AA707400 AI032028 BE540464 
A1094265 AI184281 AA931890 AW382744 AW382729 AW020448 AW827237 AA431226 AI672059 AW772345 N70172 
AW022003 AI862704 H19344 R6151 1 A 1030204 H16566 AA432248 AI767980 T16688 AI984342 AI217478 AI767095 

65 Z38551 AI359566 A1361437 AI041000 R07033 H16608 H19054 R12874 R61567 N9836B BE221 199 242320 AA094554 

R07078 AW860886 AA418090 R41262 
132837 256666 J AA370362 AA3641 10 AW959554 AW371737 AW382068 AW604716 AW604713 AA487827 AW371674 AA429137 

BE503321 T93570 W72803 AI093076 AA487977 AI241562 BE439445 AW204065 R51635 A1802994 T10362 W68553 
A1866215 AW152154 AA700716 AI127443 R15824 AI537587 AA9531 10 D58024 AI520811 AA693670 AI453280 W76329 

70 AW023955AW022563 

102898 24023,1 NMJJ02205 X06256 M13918 BE070866 AW239485 AW996127 BE273894 BE272590 BE410252 R25975 T11786 T11787 

AA301142 AA301165 AW960508 BE272819 AA386086 T39391 AA285303 AA370580 D58585 T58668 AA156213 W24142 
AA343323 AW796067 AA151197 AA376121 R94782 AA302383 H90357 R82621 AA301677 H55997 AW796059 W92358 
AL046458 AA471198 AA301952 R46287 R82694 H03186 AA187706 R32562 R27094 R25947 R25320 AW949809 H 13505 

75 H79049 R32403 H1 1213 R39710 H49765 H21142 H21006 AA417664 W52075 N56771 AA264240 N98556 N30907 
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AA707335 AW603781 AW0367 AI814584 AA524182 AA370076 AA416785 AA704082 AB06851 H25513 T56388 
AM19627 H03986 H20983 T56245 AI459715 AW973768 AI334096 AJ693020 T63414 R82646 AW167251 H55938 
AI274916 AA778367 AJ755253 AI033687 AW083222 AA 181 979 R26865 AA661627 AA706329 AI798648 AA612799 
AH 60180 A1274973 A1039284 AA301880 AJ042429 AA307632 AIQ85688 AJ278366 AJ498890 AA3Q3865 A1954844 
5 AA502380 AA156334 AA723480 A1803584 AI581026 AA304564 N51038 R94702 R69814 AW1 50962 AI570049 AA5868Q7 

AA151 198 T53400 AI557709 Ai 185326 AA309205 AW338969 R53903 AA991891 AA301 643 AM93337 A1026049 H25514 
AI741075 R28632 AW166445 AB33068 H49978 H91267 AA558193 AW079863 AA627380 AA807401 AI199958 AA666118 
AJ718216 AW193228 AJ077745 A1500496 A12S6059 AW080383 R08468 R26757 R32404 AA716599 W92322 AW77734 
A1270181 R46198 A1217540 AA304O45 AA305421 AW074445 AI468258 AW089568 AW571605 BE162930 H41009 
1 0 AAS78313 AW874497 AA1812B4 AA6S1947 T29451 D20641 T58618 AA418731 AI282S00 AW081407 AA604550 

AA729855 AI262538 AI580225 

102915 2903.2 X07820 NMJ002425 BE271570 AI263526 AW296143 AJ829878 A1973162 AJ085155 AA857496 AA709305 CO2220 

134416 30694 1 X68264 NMJM6500 AF089868 BE257461 BE275425 AW997154 AB02799 A1902803 M78206 AA085591 AW392972 

AA325490 BE006161 AA349269 AA323568 AL042548 AA191 148 AA187703 AA322791 AJ297452 T11625 AW366487 

15 AA303513 AA186961 AA1 73480 N28330N28379 W40320AA1 871 18 H03695 AA4C2709 BE407476 H06354 BE276589 

AA351284 AA379921 AL1 38060 BE4 10587 AA1 13094 AA340481 BE277483 R21191 R79518 N86170 AA320505 
AA296065 AW951900 AA658897 AA650052 AA654304 AA191691 N26649 AW080963 AI265800 N72019 AI453458 
AA092563 AA4Q2310 A1439450 AI061054 AA3023S8 T71566 AA302047 AA303432 N21289 H27357 AA303504 AJ174583 
AW151762 AA181958 AW880618 AA630773 A1889539 AW901058 A1373405 AA341941 AA086217 A1675590 AI653936 

20 AA633570 AA987619 A1270656 N93847 N40689 AW517517 N20030 W95985 AA303955 H89170 AA309917 N21642 

AA373132 W38517 AI687806 W76182 AA101065 AA036916 N45635 AI744510 AJ669803 AI039157 AI126355 AA634607 
. AW131 120 AW196838 AA190601 AA911130 BE221320 N92355 AA036752 H03698 AA588873 AI458868 AI041818 
AA090477 AI093248 AA304755 AL 137942 AL044588 AI083709 AJ 150965 N33891 AA635675 AA594898 W94657 
AA182823 AW166205 F27886 R79246 F37329 AA565697 AI075739 AID88654 AJ094287 AI204256 AA095203 T93020 

25 AA688298 AA057324 N23442 AA075411 AA305O46 AI031688 AI191503 AA1 1 1887 AA1 12264 N27929 AA187509 

AJ375522 AM74006 H06297 AI326177 N48880 H28333 AA075490 R22809 W79542 AI055934 AA042901 AA173481 
AA30198S W74531 AI051 747 AA187715 A1888868 AA993017 A1057530 T92954 N80227 AW273595 AI351260 AW170643 
AW292979 AA302605 AA302330 BE349495 AA323602 AA3Q2361 AJ470984 AA155943 AA155914 
105178 7792_1 AA313825 AW960347 AF223468 NM.016613 AA186345 AA186508 AA081195 AA147972 AA348943 AW961667 

3 0 AA187222 AA187207 AW371052 AW449751 AW748803 AW391606 AW371047 AW371057 AW371085 AW362895 

AW371092 AW377556 BE01 0930 AI01 6882 AA247878 C04398 C05158 F1 1398 AA1 88315 H233B5 R55086 H15346 
AA029106 AA2281 14 H17005 F08498 Z43376 AA095582 AA055186 AA463361 R15218 AA299132 AW103578 W21538 
AA428131 AA1871 15 AA157197 AA157167 AW371371 AA363562 AW965995 N55663 Z17878 AA228023 AI140342 
AA 100927 AA496988 AA055917 AI089303 AW014967 AW090248 AW338371 AW131066 D62983 D79713 AI583950 

3 5 AI336781 A1500705 AI471485 AW090239 D79784 D61847 D62789 D61B42 AI086327 AI273381 D81815 D63043 AS13548 

AI280560 AI510828 AA029996 C16343 C16513 AI075741 AW516308 AI804764 AA948068 AJ356588 AW103452 
AW573063 Z39445 C16489 A1949870 F04712 AA147823 AW026284 AI151538 AA081303 AA613890 AI251865 AW086499 
AA992111 A1882091 AI373465BE502094AJ922270 AA884288 AA157079 N56963 AW189145 AA428080 R55056 
AA884068 AW771716 AA186662 C16364 H15723 AJ921 181 AA156888 H17006 AA187490 AW00994 AA346942 H28533 

40 AW129047 R41656 H14636 AA995041 D58370 Z21131 D58186 AJ383271 AA643977 D58044 AI934302 AW779425 

F09065 H14930 AA890693 H23274 

105263 178672 J AWm6&Am7844DAVmm3AW388mAmmZ$AmmUAmmnAWm6G7AmmS3AmM$&7 
AW388480 AW388591 AW388711 AW388511 AW388438 AW388570 AW388449 A1694383 AW237145 AI652991 
AI964041 AW36631 9 AW366321 AW961938 AW46921 1 A16341 55 AJ492186 AI624430 AI677965 M26502 AI963871 

45 . AW378431 AW378421 AI015391 AW352126 N59336 AI352317 AW1971 13 N67998 AW778935 AM76054 AI206626 

R371 16 R4021 1 AA227926 AA639698 R38073 AI001745 T32854 AI619S49 AI423703 F10774 AW388615 T16595 H05894 
105330 182497J AW338625 R43226 R51640 AI307645 AI308100 AI085787 AI420357 AI692610 AA877160 AS53366 AA234743 
104764 90967J AI039243 R68234 AA025351 AA971063 A1537757 AA025362 R81636 T86650 

104865 102037J T79340A1742317AW182676AW451460AM20964R43284M088179AW59^^ 

50 AI827455 AA045136 AW271709 AKXM344 AA639631 AA744417 AA744218 AA045357 AA045351 

106124 54542.1 H93366 AI653547 AA336265 AW966175 BE566451 R71 178 AI630656 AA234331 N55039 AA305632 AW960431 R34044 

R32254 AW020970 AW451281 AW275041 AI636933 A1655640 AA423986 AA642466 AI684063 AIS33876 AI624897 
AA814795 AW590328 AI889166 AW243541 AI439691 AW473445 AI475516 AA741228 AI127534 AA165143 AW74714 
A1654076 AA400674 AI560249 N507O9 AW438621 AI80681 0 A1434579 AJ308184 AA423987 A) 141272 AI565586 

55 AI338440 AA219628 A1246643 At385809 AA724260 AA633988 A064172 AI798439 AI650601 R33503 AW35891 

AA903649 T96161 AA665538 AA219620 AJ309962 AA400707 BE247066 R32178 AI275962 AA661602 AW003197 
BE466649 AA831 198 AI620052 AJ825387 AI634037 AI670978 AI670979 AI655092 R32304 AA828858 AI382428 
AW023660 AA262892 T26891 AW089917 T26926 R32227 
107385 6976J NMJJ05397 U97519 AWB99329 AJ902387 AA077792 AA078525 AW376607 AA077946 AA070415 BE208721 AW1 67958 

60 BE293050 BE208240 A164869B AA101314 BE393348 BE305122 AA077591 BE274036 AA313687 BE392220 BE378954 

AA1 71461 AA46482I AW938242 AW938224 AW938243 AW938232 AA147953 N64294 AA205218 AW305O65 AW517478 
AA307983 AA377023 BE563629 R99976 N80294 T87719 T87928 AA496849 AA486344 AA204938 AW370448 AA318242 
AW964384 H92423 W95317 BE378774 BE391156 AA349138 AA173095 AW513198 AA037672 AA148029 AA169726 
W04791 AA075508 BE382937 BH395034 Af 139793 AA961734 N48612 K64714 AW151251 AI56511 3 AJ566881 

65 AW087370 AA631168 AA622014 AW513098 AI857810 AW152287 AJ052596 AI983246 AA024856 AI912456 A1677938 

AW026403 AA972537 AI088497 AW999869 W94582 AI140166 AJ 160659 A1566868 AA101263 AW190390 AW166466 
AI401207 AJ418156 AJ625265 AJ 146298 AW008592 BE223020 N58926 AI308797 AA037673 AJ935992 AJ304706 
AA024939 AI216589 AJ610423 Ai354621 AI500677 AI679389 AI799310 N64508 AJ128756 AI679897 AW589S3S 
AA989333 AI500527 AA565479 AA913529 AB23295 F21691 AA989376 AI699064 AA902447 A1690910 AA772659 

70 AA204983 AJ337895 R99975 H65205 AA340766 AI339441 A1913855 AA450293 AW192010 AA070416 N72401 AJ371481 

AI247108AI371261 AI364987A1280171 AI269104 AI868756 AA909836 AA983640 AI973271 AA913092AI868205 
AJ 1441 12 A1190975 N58065 A1566638 N93405 AW150504 AW298846 AI687036 AA902984 A1824480 AI625047 AA65314B 
A161 1228 AW131922 AA862687 AA902519 C01732 AW796045 AL044660 
101 192 15367 J BE247295 AW068092 AL041313 AA159244 NM.005415 L20859 AL135570 W47073 AW516906 BE388271 8E408629 

75 W46972 BE293646 BE256547 AI075010 AL041095 AA285300 AL039560 AA368740 W26602 AA399344 AA039235 

W27631 AW834898 AW834914 R93390 AA378039 AV649660 T53674 N 93824 AA399974 AW843378 AA368267 R08256 
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AV653575 R27900 N48215AW366371 N45500 AV652967 AI889251 AJ08O457 N39021 AT738542 AW242849 AJ857471 
AJ359775 AI582830 R75850 N66564 AW341636 AM99006 A1887217 AWQ26694 AW182840 AA039313 AA831346 
AJ393465 AW089210 A/743830 AA744243 AA401310 AW439758 AW086152 RS3391 AA291379 AA225220 AW009358 
AJ192879 AA291202 AI565089 AA225089 AA607688 AI052O58 AB41641 AI066625 AA333864 AA1S9147 AB23912 
5 R75851 AJ761143 AW768588 AA39419S AC88450 AW512564 AJ452775 AJ056520 AA468602 AAS72S66 AJ434739 

AA291838 AI948623 AW768614 AI374753 AW068174 AA884908 Al 199346 AJ 199347 W94946 AI159995 AA877642 
AB80646 A1307610 AA403310 R08205AW1 82123 AI000999 R27808 AW026571 020816 A1560350 T27667 AW960271 
AM 74628 AI432042 AJ424528 AA909562 T17342 AI783868 
1 09001 146370.3 AI056548 AW409843 AW263540 AA723669 AA909334 AA156120 AA157141 AA156125 AW409866 W19499 AA157229 
10 AW887435 
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TABLE 6: 



10 



Pkey: Unique Eos probeset Identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UrogenelD: Unigene number 

Unigene TlflK Unigene gene title 

AUC1 : 70* percentile of average Intensity (Al) for probeset at each of 2,6,15,24,43, and 96 hour timepotrds minus 700 peroenffle Al at 0 hrs, 
summed over 5 experiments. 

AUC2: AUC1flO» percentile of Al for aorta, aorfic valve, vein, and artery. 



Pkey EjlAcoi UnlgenelD UntgeneTItle 



AUC1 AUC2 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



314941 
327414 
321911 
331578 



313513 
320635 
326230 
313556 
313665 
324852 
314372 
311877 
322262 
312173 
319795 
313350 
326759 
300318 
313978 
306840 
310272 
315044 
321325 
303251 
302378 
315060 
332048 
337214 
311598 
304782 
312802 
302680 
317452 
318558 
312149 
319267 
321510 
326198 
315730 
310442 
331237 
300469 
338316 
330968 
331019 
331261 
301822 
325544 
328700 
322882 
336034 
316580 
309931 
330692 
319962 
338033 
314943 
332640 
338158 
327036 



AA515902 Hs.130650 ESTs 

predicted axon 
AF026944 Hs.293797 ESTs 
AE46482 Hs.249989 ESTs 
AB018259 Hs.1 18140 KIAA0716 gene product 
AW298600 Hs.141840 ESTs, Weakly similar to S59501 Interfere 
N50617 Hs .80506 small nuctear ribomjdeoprotein poJypepl 
predicted exon 



1038 
3032 
4292 
677.4 
395.2 
324 
394.8 
3572 
433.6 
-83 
3482 
-492 
-1309 
-247.8 
-1025.8 
203.6 
183.8 
1654.4 
^346 
576.6 
56.4 
-127.6 
-102.6 
1080.6 
1270.8 
915.8 



AA628517 Hs.1 18502 
AW751201 Hs.120932 ESTs 
AI380792 Hs.1 35104 ESTs 

AU040178 Hs.142003 ESTs, Weakly similar to The WAA0149 gen 
AA084248 Ks.85339 G protein-coupled receptor 39 
AA632012 Hs.188746 ESTs 

AI821409 Hs304471 ESTs, Highly similar to AF1 168651 hedge 
AB037821 Hs.146858 protocadherh 10 
AW591949 Hs57958 ETL protein 
predicted exon 

AW444502 Hs256982 ESTs, Highly similar to AF11 6865 1 hedge 
AI870175 Hs.1 3957 ESTs 
AW77477 Hs.307912 EST 
AF216389 Hs.148932 semaphorin Rs, short form 
BE547674 Hs204169 ESTs 

AB033100 Hs.300646 K1AA protein (similar to mouse paladin) 
AF240635 Hs.1 15897 protocadherh 12 
AL109712 Hs.296506 Homo sapiens mRNA full length Insert cON 

AA551104 Hs.189048 ESTs, Moderately similar to ALUCJiUMAN I 1236.8 

AW337575 Hs201591 ESTs 522.6 

predicted exon 269 

ESTs 796.4 
gb:nn32h08.s1 NCI_CGAP_Gas1 Homo sapiens 

ESTs 349.6 

ESTs 638.6 

ESTs 360.8 

RNA binding motrf protein. X chromosome 7002 

ESTs 2742 

ESTs 2382 

ESTs 231.8 

predicted exon 581.6 

ESTs 281.6 

ESTs -213 

hypothelicaj protein RJ21065 285 

hypothetkaJ protein FU20401 263 

predicted exon 14942 

ESTs 975.8 

lipase, endothelial 2012 
Homo sapiens mRNA; cDNA DKFZp434B0425 (f 

bitegrin.alpha2(CD49B,alpha2subuni 3562 

predicted exon 1014.6 

predicted exon 627.4 

Homo sapiens cONA FU14035 fis, done HE 843 

predicted exon 782.6 

hypothetical protein RJ12972 746.4 
gb:hd13dOU1 Soares_NFU_T_GBC_S1 Homos 

ESTs 137 
Human DNA sequence from clone RP5-650E9 

predicted exon 540.6 

ceDdMstoncycle2,G1toSandG2to -494.8 

protein regulator of cytokinesis 1 -600 

predicted exon 3112 

predicted exon 351.8 



AW023595 Hs.232048 
AA582081 

Hs.193042 
Hs.38218 
Hs.1 35568 
Hs.146381 
Hs.269651 
Hs.6818 
Hs255748 



AW192334 

AA972965 

AW402677 

T90309 

F11802 

H75391 



Hs201591 
AW072215 Hs.208470 
W87874 Hs.25277 
BE301708 Hs.233955 

R44557 Hs.23748 
NM_006033Hs.65370 
BE539976 Hs.103305 
X17033 Hs271986 



AW248508 Hs.279727 

AA938198 Hs.146123 
AW341683 
R39288 Hs.6702 
H06350 Hs.135056 

Y00272 Hs.184572 
BE56B452 Hs.5101 



303 

423 

103 

393 

32.4 

395 

35.7 

12 

05 

343 

05 

02 

1 

1 

52 
18.4 
12 
1 

23 
0.4 
0 
0 

43 
53 
153 
4.9 
4.7 
26.9 
202 

316.4105 

7.6 

633 

36.1 

6.6 

7.5 

233 

232 

62 

9.7 

03 

05 

0.3 

34.7 

1.8 

03 

478.613 

1.7 

9.4 

62.7 

5.7 

783 

133 

134.8135 
13.7 

143 05 
14 
1 
1 

31.1 
352 
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302655 AJ227892 Hs.1 46274 ESTs 1802 18 

327568 predicted exon 229 22.9 

324801 AW770553 Hs.14553 sterol Oacyllransferase (acy^Coenzyme 1612 16.1 

317850 AI681545 Hs.152982 hypolhetfcal protein RJ13117 -690 1 

5 322818 AW043782 Hs.293616 ESTs 126/4 4.5 

324626 A1685464 Hs.292638 ESTs 1702 17 

317224 X73608 Ks, 33029 sparc/osteonedin, cwcv and kazaWka d 40 0 

310955 AI476732 Hs.263912 ESTs 466.8 46.7 

_ 315240 R38772 Hs.172619 WAA1106 protein 277 27J 

10 338388 predicted axon 257.6 263 

338442 predicted exon 256 25.6 

318617 AW247252 Hs.75514 nucleoside phosphorylase 1247-6 242 

338645 predicted exon 206 20.6 

313135 N58907 Hs.162430 ESTs 2043 20.5 

15 324716 BE169746 Hs.12504 hypofoetica) protein DKFZp761D081 203.6 20.4 

330305 predicted exon 199.8 20 

308248 AI560919 gb1q41g10jd NCLCGAP Ut1 Homo sapiens 199.419.9 

308886 AI833240 gb:at76d10jc1 Baratead colon HPLRB7 Homo 1982 193 

315622 AI796144 Hs.258188 Homo sapiens cDNA FU11674 fe, done HE 1912 19.1 

20 323675 R43240 Hs272168 tumor differentially expressed 1 1892 185 

312164 T91980 Hs221Q74 ESTs 187.6 183 

300378 Z45270 Hs.235873 hypothetical protein FU22672 2713 18.7 

317476 AI343569 Hs. 107000 Homo sapiens mRNA for WDC146, complete c 187 18.7 

317559 AW452344 Hs.1 29977 ESTs 1842 18.4 

25 317207 AI873346 Hs.214505 ESTs 182.8 183 

334834 predicted exon 178.8 17J9 

320925 D62892 gbiHUM337C07B Clontech human aorta pofyA 177217.7 

303289 AL121460 Hs.272673 hypothetical protein HJ20508 316.4 17J6 

328548 predicted exon 174.6 173 

30 317108 AA884000 Hs3173 hypothetical protein FU10803 1714 172 

318013 A1188183 Hs.144078 ESTs 326 172 

314299 AW382682 Hs.154840 ESTs 1703 17.1 

317702 AW173339 Hs.135665 ESTs 1693 17 

316094 AW975920 Hs.283361 ESTs 169.4 163 

35 323706 AA377578 Hs.65234 hypothetical protein FU2059S 1692 163 

325843 predicted exon 321.4 163 

316012 AA764950 Hs.1 19896 ESTs 10472 163 

309687 AW236154 Hs.77385 rrryosin,lightpdypeptide6,anallsrrK»thinu 168.2 16.8 

323329 AL134744 Hs.10852 ESTs 168 163 

40 312853 W05086 Hs.1 14256 ESTs 167.4 167 

313070 A1422023 Hs.1 61 338 ESTs 298.6 163 

314096 AW977642 Hs.291742 ESTs 165.6 163 

338728 predicted exon 165.4 163 

316609 AW292520 Hs.122082 ESTs 165 16.5 

45 305989 AA888220 gb^15hOU1 NCLCGAP_Wd5 Homo sapiens 1643 16.5 

312642 AW052128 gb:wx26c02.x1 NCI_CGAP_KW11 Homo sapten 164 16.4 

339236 predicted exon 163.6 16.4 

317058 AI217713 Hs.147586 ESTs 1613 162 . 

311137 AW207582 Hs.196042 ESTs 5822 162 

50 310178 A1936450 Hs.147482 ESTs 1612 16,1 

320745 K51696 Hs.89278 hypothetical protein FU1 1 186 161 16.1 

317336 AW014637 Hs.130212 ESTs 160 16 

309871 AW300366 gb:xs63b05.x1 NCLCGAPJOdll Homo sapten 159316 

302038 AC004Q76 Hs.1 29709 Homo sapiens chromosome 19, cosmkJ R3021 159 153 

55 332237 N52883 Hs.102676 EST 159 153 

312362 AW015994 gb:UW^I0p*blvfl-094-UU1 NCLCGAP_S 158.6 153 

331558 N62401 Hs.48531 EST 1583 15.9 

316215 AI684535 Hs200811 ESTs 158.4 153 

336059 predicted exon 157.4 15.7 

60 302790 AJ245245 gb:Homo sapiens mRNA far Immunoglobulin 1553 153 

328418 predicted exon 1533 15.4 

304229 AK000149 Hs.29493 hypotheflcal protein RJ20142 153.6 15.4 

331606 AW273285 Hs.50802 ESTs 153 153 

338962 predicted exon 664.4 153 

65 317959 AI204202 Hs.130264 ESTs 1523 153 

336228 predicted exon 152.4 152 

313534 AW072916 Hs.78743 zinc finger protein 131 (done pHZ-10) 1522 152 

317404 AI806867 Hs.126594 ESTs 152.2 152 

311943 A1469911 Hs26498 hypothetical protein FU21 657 152 152 

70 314680 A1247425 Hs.152182 ESTs 151.4 15.1 

331484 N29696 Hs.44076 EST 1512 15.1 

338116 predicted exon 1512 15.1 

329863 predicted exon 150.6 15.1 

315555 AW452886 Hs.239107 ESTs 149.6 15 

75 317039 AA868583 Hs.126153 ESTs 149.6 15 

331138 R63816 Hs.28445 ESTs 149.6 15 
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316561 AJ917222 Hs.121655 ESTs 149.4 144 

328695 predicted axon 149.2 143 

302282 BE396283 Hs.173987 eukaryotic translation hlfiato factor 148.4 144 

318781 F11802 Hs.6818 ESTs 1482 144 

5 323709 AW297246 Hs288546 Homo sapiens cDNAFU14190fe, clone NT 148 143 

310790 AW192063 Hs.248865 ESTs 1474 144 

316833 AW292614 Hs.124367 ESTs 147.8 144 

323176 NMJX>7350Hs42101 pleckstrtn homology^ domain, famBy 229 144 

324188 AW274439 Hs^52709 ESTs 1474 144 

10 317441 AA922798 Hs. 198583 ESTs 147.4 14.7 

317584 A1825890 Hs220513 ESTs 1464 14.7 

321798 AD08206 Hs.181959 ESTs 1464 14.7 

304363 AA206045 gb:zq77f05.s1 Stratagene hNT neuron (937 1464 147 

313952 F20956 glxHSPD05390 HM3 Homo sapiens cONA done 146414.7 

15 301909 A1702609 Hs.15713 ESTs 2634 147 

309196 A1904895 Hs.9614 nucteophosmln (nucleolar phosphoprotEin 1462 14.6 

321860 N47474 Hs^12631 ESTs 1462 144 

330187 ■ predicted exon 146 144 

323042 AA463571 Hs. 172550 polypyrimxfine tract bindirg protein (he 1454 144 

20 313636 AA262397 Hs.201366 ESTs 1452 144 

302437 AB024729 Hs.227473 UDP^^tylglucosamin^a-l^mannoskl 145 144 

318197 AJ473096 Hs. 133403 ESTs 1444 144 

302749 M16951 gtoHuman tg mu-chah mRNA VDJ4-fegbn, 5 1444 144 

322357 AI734258 Hs245367 ESTs, WeaWy similar to ALU1 JiUMAN ALU S 1444144 

25 300391 AI927371 Hs288839 hypc*etical protein FU12178 144.4 14.4 

326077 predicted exon 144.4 14.4 

302004 Y18264 Hs.1 23094 sat (DrosophBaHike 1" 144 14.4 

320668 AA805666 Hs.146217 Homo sapiens cONA: FU23077 fis, done L 144 14.4 

331212 T88693 Hs.226410 ESTs 144 14.4 

30 311268 A1969727 Hs^31859 ESTs 143.2 144 

305159 AA659166 Hs275668 EST.WeaxjysbnflartoEFID JWMANELONGATtONF 143 144 

304510 AA457391 Hs.119122 ruxJSoriabiDiBinL13a 1424 144 

320852 AA772920 Hs403527 ESTs 1424 144 

330854 AW291944 Hs.122139 ESTs 1428 144 

35 318275 AW449952 Hs.190125 baslc^ellx-kxpfleRx-PAS protein 1426 144 

314992 AI824879 Hs.211286 ESTs, Weatty slmflar to 1207289A reverse 1422 142 

322631 AA001697 Hs.293565 ESTs, Weakly similar to putative p1 50 [H 1422 142 

332283 R40855 Hs.100839 EST 142 142 

302894 AA719572 Hs.274441 Homo sapiens mRNA; cONA DKFZp434N011 (fr 141.2 14.1 

40 301808 R35391 Hs.252831 ratfcutonS 141 14.1 

318608 A1204491 Hs.151502 ESTs 141 14.1 

316499 AW292947 Hs.122872 ESTs 140.8 14.1 

317011 A1246760 Hs.150276 ESTs 1404 14.1 

321840 N45600 Ks.46534 Homo sapiens mRNA; cONA DKFZp434P0714 (f 140414.1 

45 327365 predicted exon 1404 14.1 

331264 AA278898 Hs.225979 hypothetical protein similar to small G 1404 14.1 

324545 AW501944 Hs. 127243 Homo sapiens mRNA tor K1AA1724 protein. 140.4 14 

312986 AA211586 gb:zn56dG5.s1 Stratagene muscle 937209 H 1402 14 

316053 AA825814 Hs.149065 ESTs 140.2 14 

50 330723 BE247449 Hs.31082 rrypothefical protein FU10525 1402 14 

304876 AA595765 gbmj28g06 .si NCLCGAPJW1 Homo sapiens 1394 14 

311379 AW134766 Hs.202450 ESTs 1394 14 

318265 AW019873 Hs.146840 ESTs 1394 14 

324137 AA393127 Hs.222762 ESTs 1394 14 

55 328262 predicted exon 1394 14 

322349 AK001279 Hs.180171 Homo sapiens cDNA FU10417 fis, done NT 139.4 13.9 • 

323504 AA2S0223 Hs.130865 ESTs 139.4 134 

304261 AA059387 gbzf66d01.s1 Soares retina N2WHR Homo 1392 13.9 

310489 AW451493 Hs.235516 hypoftetlcal protein PRQ2955 1392 13.9 

60 335946 predicted exon 139.2 134 

318155 AI041546 Hs.132133 ESTs 1384 13.9 

313796 AT797169 Hs.208486 ESTs 1384 134 

333977 predicted exon 138.6 134 

324845 AW969635 Hs.283718 ESTs 1382 134 

65 331139 R65706 gb:yii6g12s1 Soares placenta Nb2HP Homo 1382 134 

331131 R54797 gb:yg87b07.s1 Soares Infant brain 1N1B H 669.6 134 

321250 H58539 Hs.1S1692 ESTs 138 134 

312498 AA668782 Hs.191284 ESTs, Weakly slmflar to ALU1_HUMAN ALU S 1374134 

331252 W52470 Hs.34578 a!pha^3^iaJyrtransferase 1374 134 

70 337407 predicted exon 1374 134 

303973 AW512014 gbxx68a03jc1 NCLCGAPJ.ym12Homosapten 137.4137 

314582 AA412258 Hs.188817 ESTs 137.4 137 

327373 predicted exon 1372 137 

323367 AA234591 Hs404123 ESTs 1364 137 

75 316207 AA832065 Hs.120260 ESTs 136.4 134 

315231 AA705809 Hs.1 19922 ESTs 136.2 134 
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318592 T39310 Hs.1139 cold shock domain protein A 136.2 13* 

320908 AW969706 Hs.293332 ESTs 1362 13j6 

328937 predicted exon 1362 116 

329073 predicted exon 1362 13* 

5 318231 AV659082 Hs.134228 ESTs 138 13* 

311992 AL380200 Hs.114145 ESTs 135.8 13j6 

316497 AA768457 Ks.136849 ESTs 135* 13* 

317677 AA968594 Hs.127868 ESTs 135* 13* 

321680 W02B48 HsJ93704 ESTs 135* 13* 

10 326080 predicted exon 135* 13* 

330938 AF036943 Ks.172619 WAA1 106 protein 135* 13* 

306573 AL134878 Hs.1 19500 rtoosomal proteK brae P2 135* 13* 

307383 AG23207 Hs.147888 EST ■ 135* 13* 

311114 AW449382 Hs.195297 ESTs 135* 13* 

15 320579 R15138 Hs.165570 Homo sapiens clone 25052 mRNA sequence 135 13* 

301328 AA884104 Hs.125546 ESTs 134* 13* 

312063 N5B198 Hs.182898 ESTs 134* 13* 

323036 H09604 Hs.13268 ESTs 134* 13* 

332776 AF241850 Hs.151428 ret linger protein 2 134.4 13.4 

20 332494 AA282330 Hs.1 45668 ESTs 134.2 13.4 

334376 predicted exon 1342 13.4 

313264 N93416 Hs.118228 ESTs 133* 13.4 

313669 AA351109 Hs*437 Taxi (human T-cefl leukemia virus type I 1332 13* 

312083 T87398 Hs205816 ESTs 132* 13* 

25 319354 AA993807 Hs.167367 ESTs 132* 13* 

307414 AI242106 gb#i92a02j(1 ScaresJlR^T_GBC__$1 Homo s 1322132 

312771 AA018515 Hs264482 Apg12(aiitoor^y12,S.c«iBVisiaeHike 131* 132 

313004 AI274963 Hs.145900 ESTs 1312 13.1 

300995 AW510641 Hs258018 ESTs 220.6 13 

30 319323 F12650 Hs.13287 ESTs 125.4 12* 

329451 predicted exon 123.4 12* 

337603 predicted exon 572 122 

312480 R68651 Hs.144997 ESTs 121.4 12.1 

324934 AW452051 Hs.147546 ESTs 119.4 11* 

35 320723 BE178025 Hs.7942 hypothetical protein FU20080 117 11.7 

318188 AI792566 gb:ql74f02y5 NCI_CGAP_Ov26 Homo sapiens 116*11.7 

320873 AF238869 Hs283955 Homo sapiens clone GLSH-2 similar to gD 112* 11* 

331005 BE003191 Hs.119555 ESTs 112* 11* 

304969 AA614406 gbmp4Sfl)5.s1 NCI_CGAP_Br11 Homo sapiens 112.4112 

40 319799 AI139253 Hs.227767 a'nc finger protein 41 1112 11.1 

302610 AA347945 Hs256024 ESTs 111 11.1 

309485 AW13O320 Hs.108124 ribosorrral^teInS4,X-Ilnked 111 11.1 

311880 AW419225 Hs256247 ESTs 1102 11 

313981 AW452334 Hs.128148 ESTs 1102 11 

45 322442 W49701 Hs29667 ESTs 109.4 10* 

315099 AA806538 Hs291841 ESTs 109 10* 

304793 AA583264 Hs. 182979 riboscmalproteinl.12 108* 10* 

330815 AA019211 Hs236463 KIAA1238 protein 108* 10* 

304044 T81656 Hs252259 ribosomal protein S3 714* 10* 

50 325222 predicted exon 135 10* 

325889 predicted exon 814.6 10* 

321447 AW891130 Hs.33173 ESTs 107* 10* 

302990 AA496212 Hs.180182 ESTs 1062 10* 

308106 A1476803 gMj77e12jc1 Soares_NSFJ^8_9W_OT_PAJ J _S 270*10* 

55 310536 AD01041 Hs.150174 ESTs 106 10* 

315257 AW157431 Hs.248941 ESTs 233 10* 

318787 Z42313 Hs22657 ESTs 105.8 10* 

312306 AJ927226 Hs.175610 ESTs 1052 10* 

326788 predicted exon 104.4 10.4 

60 312234 AA830840 Hs206934 ESTs 104 10.4 

314482 AW085525 Hs.134182 ESTs 234 10.4 

323597 AI185693 Hs.135119 ESTs 102.4 102 

302623 AW836724 Hs.194110 hypothetical protein PRO2730 162.4 102 

323594 AI791531 Hs.129993 ESTs 101 10.1 

65 324315 N55761 Hs.194718 zinc finger protein 265 100.2 10 

314217 AA256465 Hs.188725 ESTs 992 9.9 

320932 AA554913 Hs.162297 ESTs 982 9* 

327876 predicted exon 982 9* 

319736 R17424 Hs.6650 vacuotar protein sorting 45B (yeast homo 98 9* 

70 327747 predicted exon 97.6 9.8 

327844 predicted exon 97.4 9.7 

318200 AI061192 Hs.166517 ESTs 972 9.7 

329414 predicted exon 972 9.7 

318296 AI089667 Hs270713 ESTs 121.4 .9.7 

75 307010 AIM0014 gb;aa68/D9Jc1 Soares>teU>earLNbHH19W295 9.7 

319792 AI138635 Hs.22968 ESTs 385.4 9* 
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305671 AAB11688 Hs32113 dUTPpymphosphatase 96 9.6 

329440 predicted exon 933 9.4 

310381 AI263059 Hs.145594 ESTs 93.4 9.3 

318824 F06771 Hs.27226 ESTs 93.4 93 

5 3269S7 predicted exon 922 9.2 

318804 Z42S49 Ha.160893 ESTs 92 9.2 

330836 AA055611 Hs226568 ESTs, Moderately sbnDar to AIU4_HUMAN A 92 9.2 

324592 AW752437 HS325708 ESTs 913 9.2 

311820 AW274545 Hs254333 ESTs 91.4 9.1 

10 321614 H86161 gb.ys94b01.r1 Soares retina N2b5HR Homo 91 9.1 

330306 predicted exon 91 9.1 

303096 AL080276 Hs.268562 regulator of G-proteln signalling 17 90 9 

313275 AJ027604 Hs.159650 ESTs 110.4 8.8 

302593 H54855 Hs.36958 ESTs 88 8.8 

15 321421 BE465115 Hs.171688 ESTs 862 8.6 

330832 AJ 133530 Hs.62930 ESTs 456.4 8.6 

311847 AW301807 Hs.297260 ESTs 86 8.6 

322036 BE002723 Hs.301905 Homo sapiens cONA RJ14060 fls, done HE 1453 8.6 

328688 predicted exon 853 . 8.6 

20 325251 predicted exon 85.4 8.5 

329088 predicted exon 85.4 8.5 

322524 W79027 Hs271762 ESTs 84 8.4 

337953 predicted exon 451 8.3 

323529 AA284397 Hs201485 Homo sapiens clone FLC06S4 PR02866 mRNA, 82,6 8.3 

25 307041 AI144243 gkqb35b12.x1 SoareaJetaLhearU4bHH19W 306.88.2 

318285 AI332454 Hs.158412 ESTs 81.4 8.1 

312021 AA759263 Hs.14041 ESTs 81 8.1 

329350 predicted exon 81 8.1 

326169 predicted exon 60.4 8 

30 338038 predicted exon 1024J2 7.9 

312549 A1214510 Hs.146304 ESTs 77.4 7.7 

312542 D60076 gtxHUM084E10A Ctontech human fey brain 763 7.7 

320992 AB026891 Hs225972 solute canter family 7, (cation* amino 76 7.6 

318596 AI470235 Hs.172698 EST 150.6 7.5 

35 315650 AA649042 Hs269615 ESTs 73.4 73 

324328 AA447276 Hs.292020 ESTs 210.4 7.1 

332622 R10674 Hs. 128856 CSR1 protein 702 7 

328229 predicted exon 69.4 6.9 

319110 T75260 Hs38321 hypoftefical protein FU14103 683 63 

40 316133 A1187742 Hs.125562 ESTs 308,6 6.9 

303992 AW515800 gb:hd68g01 j(1 NCLCGAP.GC6 Homo sapiens 673 63 

322675 AA017656 Ha 146580 enoiase I (gamma, neuronal) 377.2 6.7 

325753 predicted exon 1052 6.6 

312539 AKW4377 Hs.200360 Homo sapiens cDNAFU13027fis,ctone NT 922 6.4 

45 302592 AA294921 Hs.250811 vwal simian leukemia viral oncogens horn 361.6 6.3 

314578 AA410183 Hs.137475 ESTs 201.6 6.1 

335986 predicted exon 1083 6 

321478 AW402593 Hs.123253 hypofceficai protein FU22009 523 6 

305192 AA666019 gfcan44a04.s1 Jla bone marrow stroma Horn 58.6 53 

50 304275 AA070605 gb*^m53h09^1 Stratagene fibroblast (937 783 5.6 

302779 AJ235667 gb:Homo sapiens mRNA for Immunoglobulin 2783 5.5 

301976 T97905 Hs.77256 enhancer of zeste (Drosophila) homoiog 2 4792 5.4 

316021 AW293399 Hs.144904 nuctear receptor co-repressorl 792.4 5.3 

320802 BE336699 Hs.185055 BENE protein 24233 5.3 

55 317282 AJ733112 Hs.176101 ESTs 5232 5.1 

316827 AI380429 Hs.172445 ESTs 578 5.1 

303190 BE260787 Hs.16079 hypothetical protefo RJ10233 223 5.1 

315587 A1268399 Hs.140489 ESTs 136.2 5 

333122 predicted exon 399 5 

60 310214 A1220072 Hs.165893 ESTs 234.4 4.9 

320089 043945 Ks.113274 transcription factor EC 68 4.9 

309328 AW024348 Hs.233191 EST, WeaWy similar to A27217 glucose tr 2583 4.8 

318971 Z44067 Hs.10957 ESTs 3763 43 

327220 predicted exon 47.4 4.7 

65 315757 AW0146O5 Hs.179872 ESTs 177.4 4.7 

320730 R68669 Hs.151072 ESTs 205.2 4.6 

313339 A1682536 Hs. 163495 Homo sapiens cONA FL) 13608 fts. done PL 260 4.5 

318634 T49598 Hs.156832 ESTs 4752 4.5 

320955 AW820O35 Hs.278679 a dointegrin and rrutoBoprotelnase <toma 3883 4.4 

70 306605 AI000497 Hs.119500 ribosomalprotein.larg^ 313 4.4 

309349 AW051913 gbm24a09j(1 NCLCGAPJOdU Homosaplen 102.443 

306004 AA889992 Hs.2186 eukaryoflctranslafic^longatcnfactorlfla 4512 42 

330020 predicted exon 612 4.1 

302308 AW327279 Hs.91379 ribosornal protein L26 342 33 

75 314648 AW979268 gbf ST391378 MAGE resequences, MAGP Homo 56.4 33 

315131 AI753709 Hs.152484 ESTs 130.4 3.7 
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313890 AJ493591 Hs.78146 platslsVendolheSal cell adhesion mrJee 3179.6 16 

333565 predicted exon 175.4 3.5 

312911 H93366 Hs.7557 Homo sapiens cDNA: FU21962fis, done H 219 33 

322966 AA633669 Hs,235920 Homo sapiens ceE recognifoi motectite C 3502 3.4 

5 312492 R71072 Hs.191269 ESTs 322.8 3 

318988 Z44203 Hs26418 ESTs 25 2.5 

332363 AI123705 Hs.106932 ESTs 773.4 15 

324181 AI025476 Hs.131628 ESTs 6343 14 

311717 AW2053S9 Hs.312830 ESTs 542 2.4 

10 321342 AA127984 K&222024 transcription factor BMAL2 23.4 13 

308852 AJ829848 Hs. 182937 peptWytpro!ytlsomeras8A(cydophirinA) 92 13 

331466 AA373210 Hs.43047 Homo sapiens cONAFU13585 lis. done PL 494 13 

320279 AB033062 Hs.134970 DKFZP434W178 protein 762 12 

322221 N24236 Hs. 179862 nudeosome assembly proteki 1-Qce 1 2532 11 

15 302925 AL137449 Hs. 126666 homeoboxB4 136.6 11 

331384 AB041035 Hs£3847 NADPH oxidase 4 720 1.8 

300938 AA514418 Hs.152320 ESTs, Weakty similar to 1 605244A erylhro 27 1.8 

312695 AW196663 Hs2Q0242 ESTs 303.8 13 

320223 W35132 Hs.267442 ESTs 189 1.5 

20 332743 AW247977 Hs37595 transk)case^lnr^rr#x*orKir^rr£mbr 14.4 1.4 

331039 AW378685 Hs.18625 M&>chondrial AcyWJoA TWoesterase 5293 1.4 

333123 predicted exon 3962 1.4 

328455 predicted exon 91.6 13 

334456 predicted exon 406.4 1.3 

25 313478 AA643008 Hs.192775 ESTs 413.4 1.1 

309899 AW338564 Hs217493 annexinA2 -303 1 

311735 AW294416 Hs. 144687 Homo sapiens cDNARJ 129818s, done NT -618 1 

312953 NA/L001992HS.126087 coagulation factor II (thrombin) recepto .73.6 1 

313055 AW367295 Hs241175 ESTs 433 1 

30 313291 A1267970 Hs.150614 ESTs, Weakly similar to ALU4 JtiJMAN ALU S -63 1 

315059 AW275110 Hs.271106 ESTs -67 1 

322284 AI792140 Hs.49265 ESTs 4952 1 

322450 AL121278 Hs25144 ESTs -13 1 

324603 AW975183 Hs.292663 ESTs 4.4 1 

35 331495 AW970939 Hs291039 ESTs -2818 1 

333610 predicted exon -1516 1 

335093 predicted exon -232 1 

339403 predicted exon -3312 1 

302820 X04588 Hs35844 neurotrophic tyrosine kinase, receptor, 591.2 1 

40 302270 R56151 Hs.93589 Homo sapiens mRNA; cONA DKFZp564B1 162 (f 276.61 

323755 AW300094 Hs.136252 ESTs 135 0.9 

326946 predicted exon 727.4 0.9 

315343 BE144306 Hs.179891 ESTs, Weakly similar to P4HA JtUMAN PROLY 12180.9 

' 311168 AKD01270 Hs. 196086 hypotheSca! protein FU 10408 304 0.9 

45 329732 predicted exon 109.2 0.9 

321415 BE621807 Hs.3337 transmembrane 4 supertamily member 1 4143 0.7 

333121 predicted exon 873 0.7 

333120 predicted exon 3793 0.7 

330392 AW797956 Hs.75748 proteasome (prosome, macrapain) subunrt, 5892. 0.7 

50 314711 AA769365 Hs.126058 ESTs 47 0.6 

330865 BE409857 Hs.69499 hypothetical protein 347.4 0.6 

333169 predicted exon -1182 03 

335095 predicted exon 1064 03 

335815 predicted exon -156 0.6 

55 330232 predicted exon 102.6 0.6 

330823 AA031565 Hs.221255 ESTs, Moderately similar to ALU5_HUMAN A -62 03 

331704 F04225 . Hs.66032 ESTs -143 0.5 

302642 NM_01642BHs.130719 NESH protein 2673 0.5 

304484 AA432067 Hs.258373 ESTs 85 0.5 

60 310230 AKD00377 Hs.144840 homolog of mouse C2PA -70 0.4 

301531 AJ077462 Hs.134084 ESTs -195.4 0.4 

306337 AA954221 Hs.73742 ribc^lprotein.brge,P0 -33.4 0.4 

331327 N46436 Hs.109221 ESTs 492 0.4 

332961 predicted exon -5.6 0.4 

65 322796 W31178 Hs.154140 Homo sapiens ovary-specific addle prote 480.6 03 

328857 predicted exon 552 0.3 

316342 AA743935 Hs202329 ESTs 43.4 0.3 

331263 AW780192 Hs.267596 ESTs -180.4 0.3 

335987 predicted exon -134 0.3 

70 311923 T60843 Hs.189679 ESTs 112 03 

310522 AW134529 Hs244647 ESTs -1873 0.3 

315363 AA759190 Hs.121454 ESTs. Weakly similar to olfactory recept 60 0.3 

302032 NMJJ01992HS.12S087 coaguiafion factor U (fbrombki) recepto 477 03 

313140 BE265133 Hs217493 annexinA2 954 0.3 

75 310860 AW015920 Hs. 161 359 ESTs -239 03 

317899 AI952430 Hs.150614 ESTs. Weakly simBar to ALU4J<UMAN ALU S -7152 03 
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328520 predicted exon -109-2 02 

302406 NM.012099H&211956 (mepsiJavassodated protein; anfeens 10 02 

311804 AI866921 Rs .203349 Homo sapiens cDMA FU12149 fis, done MA -252.6 02 

315065 AK0Q1122 Hs. 105859 rrypothefoal protein FU10260 4Q2 02 

5 314129 AA228366 Hs.1 15122 ESTs -308.8 0.2 

335697 predicted exon 472 0.2 

335989 predicted exon 89 02 
320606 AW857943 Hs.127216 hypdhefical protein FU 13485 -205.6 0.2 
329745 predicted exon 103 02 

10 313628 AW419069 Hs.209670 ESTs -177.8 02 

334616 predicted exon -836.6 02 

308820 A1821267 Hs.207243 EST -72 02 

320416 A1026984 Hs.293662 ESTs -16.4 02 

335211 predicted exon -142 02 

15 323629 AA375957 Hs.6682 ESTs -100 0.1 

331420 AW452904 gblW«l3^h-11-04JLs1 NC|_CGAP_Su 83 0.1 

315984 AI015862 Hs.131793 ESTs -250.6 0.1 

332833 predicted exon 3742 0.1 

332607 NM_002314Hs.36566 UM domain kinase 1 -21 Jo 0.1 

20 313467 AA004879 Hs.1 8782) ESTs -288.2 0.1 

323333 AV651680 Hs.208558 ESTs -735.6 0.1 

330775 AW247020 Hs.250747 SUMO-1 acfivafing enzyme subuntt 1 53.6 0.1 

333186 predicted exon -1041 J 0.1 

332079 AI306876 Hs.103849 ESTs 19.4 0.1 

25 322724 AF161442 Hs.1 91591 Homo sapiens HSPC324 mRNA, partial cds -123J6 0.1 

303652 AI799111 Hs.64341 ESTs -46.4 0.1 

303131 AW081061 Hs.103180 DC2 protein -156.4 0.1 

320716 A1479439 Hs.171532 ESTs -146.6 0.1 

300454 AA659037 Hs.163780 ESTa -304 0.1 

30 312757 AI285970 Hs.183817 ESTs -445 0.1 

312391 R43707 Hs.133159 ESTs, Weakly similar to PIHUSD salivary -111.8 0.1 

308877 A1832519 gbat69h03^1 Barstead colon HPLRB7 Homo -149.6 0 

311275 AI659166 Hs.207144 ESTs -62.6 0 

302363 AW163799 Hs.198365 2^isphosphog!ycerate rnutase -15 0 

35 321717 AW956580 Hs.42699 ESTs -1059.6 0 

302638 AA463798 Hs. 102696 MCT-1 protein -332.2 0 

306352 AA961367 gb:or52a05.3l NC1_CGAP_GC3 Homo sapiens 21 J 0 

313798 AJ292148 Hs.71622 SW1/SNF related, matrix associated, acti -07.2 0 

320807 AA135370 Hs.188536 Homo sapiens cDNA: FU21635 fis, done C -2222 0 

40 320931 AW262836 Hs.252844 ESTs -881.6 0 

332450 AW288085 Hs.11156 nypoflreScal protein 28.4 0 

332535 Af 167706 Hs.19280 cysteine^ motor neuron 1 -722 0 

335990 predicted exon -421 0 
330746 AB033886 Hs.8619 SRY (sex determining region YJ-box 18 35.4 0 

45 316820 AI627912 Hs.130783 Forssman synthetase -373.6 0 

337429 predicted exon -257 0 

331192 BE622021 Hs.152571 ESTs,HTghr/s^tolGF-llrr^-bind 43 0 

330509 AI346201 Hs.76118 ubiquiUn carooxyUarminal esterase L1 -280 0 

323593 AJ 739435 H&39168 ESTs 4627.6 0 

50 302704 AA531133 Hs.4253 hypothetical protein MGC2574 478.6 0 

330534 NM_004579Hs.82979 rrdt^erKrtivating protein kinase kinase -244 0 

332374 X91195 Hs.100623 prraspt>o^C,beta3.n^hborpseudo -12042 0 

333221 predicted exon -189.6 0 

335988 predicted exon -122.6 0 

55 330574 A1984144 Hs.66713 hepaftis delta anfigerrtrrteracting prat -2257.4 0 

312052 BE621697 Hs.14317 nucleolar protein tamfly A, member 3 (H/ -359.2 0 

319568 AF131781 Hs.84753 hypofiieScai protein FU12442 # 474.6 0 

337113 predicted exon -24.6 0 

335149 predicted exon -191.8 0 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. The pkeys in Table 7 lacking unigenelO's are represented within 
Tables 1-6A. For each probeset we have fisted tha gene duster number from which (he oQgonucteofides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwfst, Oakland California). The Genbank accession numbers for sequences comprising each duster are fisted In the 'Accession' 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 



320925 1525201.1 D62892 D79755 D62760 

20 321614 87866.1 H86161 AA054308 AA018955 

313952 136885.1 F20956 AA129374 AA133740 AW819878 

314648 293660.1 AW979268 AA878419 AA431342 AM31628 

302749 458 107 M16951 M16952 M 16948 M16949 M16950 

312362 764066.1 AW015994 R39898 AW000978 AJ598202 AB21706 

25 312542 1522649 J D60076 D6Q259 D61037 

312642 1005225.1 AW052128 H51439 H51481 

312986 171879.1 AA211586 F35789AA211641 F29720AW937387AW937408 

329350 cjtjis 

^ 329414 c_y_hs 

30 329440 c_y_h3 

329451 c_y_hs 

338033 CH22^6528FG - JJNK.EMAC00 

338038 CH22^6535FG JJNK_ENtAC00 

338118 CH22J650FG_JJNK.EMAC0O 

35 338158 CH22_6700FG_JJNrCEM:AC0Q 

329732 c14_p2 

329745 c14_p2 

308106 A1476803 

329863 c14_p2 

40 338316 CH22_6944FG_UNK_EMAC00 

308248 AI560919 

338388 CH22J034FG_JJNK^EMAC00 

338442 CH22_7109FG_UNK.EMACOO 

338645 CH22 J41 0FG_JJNK_EM:ACO0 

45 338728 CH22.7527FGjUNK_EKtAC00 

308877 AI832519 

338962 CH22.7838FG.JJNKJW32I10 

308886 AJ 833240 

333120 CH22.349FG_81.3_UNK.EMA 

50 333121 O^350FG.81_4JJNK_EfAA 

333122 CH22.351FG_81.6_UNK.EMA 

333123 CH22_352FG.8U_UNieEMA 

333168 CH22.400FG_94.1_UNK.EMA 

333169 CH22_40irc_94_2JJNK_£MA 
55 333221 CH22.458FG.105.1_UNK_EM: 

326077 c17_hs 

326080 c17_hs 

326169 c17_hs 

326198 c17_hs 

60 326230 c17_hs 

333585 CH22.846FG.203_4JLINK_EM: 

333610 CH22j71FGJ217_5_UNK_tM: 

335093 CH22_2423FG.492.3JJNK_EM 

335095 CH22^2425FG_492_5_UNK_EM 

65 335149 CH22J!484FG_499_5JJNK_EM 

326759 c20_hs 

333977 CH22J254FG_3Q9_6JJNK_EM 

326788 c20_hs 

335211 CH22 W 2550FG 511_2JJNK_EM 

70 305192 AA666019 

303973 AW512014 

303992 AW515800 

326946 c21Jra 

328229 c_6_hs 

75 328262 c_6_hs 
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328418 Qj_hs 
328455 c_7Jis 

335697 CH22_3058FG_596J2JINK^E 
328520 c_7_hs 
5 328548 c_7_hs 

335815 CH22.3187FG_618_3JUNKJM 
328688 c_7_hs 
328695 c_7_h3 
307010 A1140014 
10 337113 CH22_5058FG_493J_ 
307041 AI144243 
328700 c_7_hs 

335946 CK22_3324FG_646J0_L/NKJ> 
335986 CH22_3366FG_654_10_UNK_D 
15 335987 CH22J387FGJ54 J 1JJNKJ) 

335988 CH22_3368FG_654 J 2JJNKJ) 

335989 CH22JJ369FG_655JUJNK_DJ 

335990 CH22_3370FG_655_4JJNK_DJ 
337214 CH22_5288FG_613_7_ 

20 330020 c16_p2 

305989 AA888220 

328857 c_7Jis 

328937 c_8_hs 

32B957 clSJw 
25 330187 c_4_p2 

337407 CH22_5607FG_755_1_ 

337429 CH22_5633FG_762_3_ 

330232 c_5 _p2 

307414 A1242106 
30 330305 cl7j>2 

330306 c_7_p2 

337603 CH22_5896FG_LINK_C20H12. 
337953 CH22^6395FG_LINK.EMAC00 
339236 CH22_8181FG_LINK_BA354I1 
35 339403 CH22_8384FG„LINK„BA232E1 
309349 AW051913 
325222 clOJu 
325251 dOJis 

318188 956161J A1792565 AI053836 AI054127 A1792489 A1288324 
40 309871 AW300366 
325544 c12Ja 
309931 AW341683 

332833 CH22_50FG_17J.UNICC20H1 

302779 33837 J AJ235667 AJ235666 AJ235664AJ235665AJ235668 AJ235669 AJ235670 
45 302790 34168J AJ245245 AJ245247 AJ245257 AJ245248 AJ245254 AJ245256 AJ245253 AJ245203 AJ245250 AJ245252 AJ245243 AJ245204 
AJ245201 AJ245206 AJ245246 AJ245255 AJ245205 AJ245202 AJ245251 AJ245249 AJ245207 AJ245244 

332961 CH22J85FG_48J8_UNK£M: 

325753 c14_hs 

327036 c21_hs 
50 325843 c16Jb 

325889 c16_hs 

304261 AA059387 

304275 AA0706O5 

334376 CH22J67GFG_379JJ.INK.EM 
55 327220 Q_1J» 
304363 AA206045 

334458 CH22J757FGJ91_2JJNK_EM 
327365 Q_1Jw 
327373 c_2J» 
60 334616 CH22J923FG_411J5JUKK_E 
327414 <l2Jis 
327568 OJa 

336034 CH22_3419FG_678_5_UNK_DJ 
336059 CH22_3445FG_684J UNKJW 
65 334834 CH22_2148FGjK»JJJNK_EM 
304782 AA582081 
304876 AA595765 
327747 c_5_hs 

336228 CH22_3626FG_730_4JJNK_DA 
70 329073 cjLhs 

329088 cj<Jw 

304969 AA614406 

327844 c_5_hs 

327876 c_6Ju 
75 306352 AA981367 

331131 gonbanKJl54797 R54797 
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331139 genbank_R65706 R65706 

331420 675963 J AW452904 AW449414 BE467908 A1298565 BE549932 BE326357 PD4362 
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TABLE 6B 

Table 6B shows (he genomic positioning for those pkeys lacking unJgena ID's and accession numbers in Table 6. The pkeys in Table 7 lacking 
unigenetD's are represented wiftln Tables 1-66. For each predicted exon, we have listed the genomic sequence source used for prediction. Nucleotide 
locations of each predicted exon are also listed, 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers tn this cota ere Genbank Identifier (Gi) numbers. "Dunham L et at* refers to (he pubfxation 

entitled The DMA 

sequence of human chromosome 22.' Dunham I. et aL, Nature (1999) 402:489-495. 
Strand Indicates DNA strand from which exons were predicted, 

NLposttlon: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposIUon 


332961 


Dunham, I. etal 


Plus 


2521424-2521555 


333221 


Dunham, I. etaL 


Plus 


39780703978187 


333585 


Dunham, LetaL 


Pius 


6234778-6234894 


333610 


Dunham, I. etal. 


Plus 


6547007-6547116 


334376 


Dunham, I. etaL 


Plus 


13902218-13902331 


334458 


Dunham. I. etal. 


Plus 


14353496-14353572 


334616 


Dunham, I- etal. 


Plus 


15176123-15176470 


335149 


Dunham I. etal. 


Plus 


21497441-21497587 


335211 


Dunham, I. etaL 


Plus 


21774611-21774680 


335697 


Dunham, L etal. 


Phis 


25481456-25481649 


335966 


Dunham, I. etal. 


Plus 


27967791-27967852 


335967 


Dunham, I. etaL 


Plus 


27971413-27971481 


335988 


Dunham, total. 


Ptus 


27977912-27978013 


335969 


Dunham, I etaL 


Plus 


27983788-27983860 


335990 


Dunham, I. etal 


Plus 


27988532-27988608 


336034 


Dunham, I. etal. 


Phis 


29014404-29014590 


337953 


Dunham, I. etal. 


Plus 


6827029-6827125 


338033 


Dunham, I. etal. 


Pius 


8092128-8092271 


338038 


Dunham LetaL 


Plus 


81362194138392 


338316 


Dunham, 1. etal. 


Plus 


17089711-17089988 


338442 


Dunham, LetaL 


Plus 


19980640-19980698 


338962 


Dunham, LetaL 


Plus 


29581892-29582020 


332833 


Dunham, LetaL 


Minus 


1119848-1119705 


333120 


Dunham, i. etal. - 


Minus 


3307508-3307427 


333121 


Dunham, 1. etal. 


Minus 


3308446-3308356 


333122 


Dunham, 1. etal. 


Minus 


33095963309531 


333123 


Dunham, LetaL 


Minus 


3310817-3310749 


333168 


Dunham, LetaL 


Minus 


3729896-3729788 


333169 


Dunham. LetaL 


Minus 


37308643730767 


333977 


Dunham, 1. etal. 


Minus 


8722928-8722725 


334834 


Dunham, 1. etal. 


Minus 


17182681-17182535 


335093 


Dunham, LetaL 


Minus 


21297367-21297214 


335095 


Dunham. 1. etal. 


Minus 


21292546-21292381 


335815 


Dunham, L etaL 


Minus 


26320518-26320421 


335946 


Dunham, LetaL 


Minus 


27467203-27487035 


336059 


Dunham, LetaL 


Minus 


29184079-29183969 


336228 


Dunham. 1. etaL 


Minus 


30904602-30904497 


337113 


Dunham, Letat 


Minus 


21233344-21233237 


337214 


Dunham, LetaL 


Minus 


26095902-26095502 


337407 


Dunham, 1. etaL 


Minus 


3166665231886567 


337429 


Dunham, LetaL 


Minus 


32066238-32086079 


337603 


Dunham,!, etal 


Minus 


1299296-1299194 


338116 


Dunham, t etaL 


Minus 


10614071-10613814 


338158 


Dunham, 1. etal. 


Minus 


11794465-11794343 


338388 


Dunham, LetaL 


Minus 


16662403-18662305 


338645 


Dunham. LetaL 


Minus 


24063839-24063775 


338728 


Dunham, LetaL 


Minus 


25949039-25948927 


339236 


Dunham, 1. etaL 


Minus 


32773355-32773202 


339403 


Dunham, 1. etaL 


Minus 


34050728-34050625 


325222 


6525287 


Minus 


22332-22473 


325251 


6682448 


Minus 


411693-411751 


325544 


6682452 


Plus 


171228-171286 


325753 


6682474 


Plus 


398512^398621 


329745 


6065779 


Plus 


174774-175142 


329732 


6065783 


Plus 


161252-161322 


329863 


6691797 


Phis 


196801-196971 


325B89 


5867087 


Plus 


223829-223891 
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325843 6552453 

330020 6671887 

326198 5867215 

326230 5867230 

326169 5867255 

326077 6682495 

326080 6682495 

326759 6249610 

326788 6682503 

326946 6004446 

327036 6531965 

327220 5867525 

327365 6552412 

327414 5867750 

327373 5887792 

327568 5887811 

330187 6706138 

327747 5887947 

327844 6249582 

330232 6013526 

326229 5868105 

327876 5868140 

328262 6381906 

328688 5866262 

328700 5868264 

328695 5868264 

328416 5868409 

328455 5868431 

328520 5868477 

328548 5868487 

328857 6381927 

330305 4877982 

330306 4877982 
328937 5868500 
328957 6456773 
329073 5868596 
329088 5868608 
329350 6456785 
329414 5868874 
329440 5868885 
329451 5868887 



Minus 


7126-7232 


Plus 


172397-172491 


Minus 


8029540674 


Minus 


301868-301972 


Minus 


128321-128388 


Minus 


312108-312168 


Phis 


478644478847 


Plus 


97216-97311 


Plus 


277132-277335 


Minus 


116877-116967 


Plus 


319951-320040 


Minus 


65701-65781 


Mhus 


118133-116198 


Plus 


102461-102586 


Minus 


81864742 


Minus 


46152-46287 


Pius 


212923-213020 


Plus 


115322-115498 


Minus 


18895-18956 


Plus 


113655-113830 


Minus 


120936-121053 


Plus 


103882-104034 


Plus 


11867-12027 


Plus 


626030-626094 


Plus 


764089-764203 


Plus 


318632-318695 


Minus 


258811-258894 


Plus 


385576-385633 


Plus 


1942075-1942246 


Plus 


72301-72397 


Minus 


8055741051 


Minus 


5226942365 


Plus 


96161-96233 


Minus 


1448241-1448333 


Plus 


219195-219297 


Plus 


3783847956 


Plus 


116738-116950 


Plus 


98911-98969 


Plus 


942555-942643 


Plus 


21943-22063 


Plus' 


25974-26048 
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TABLE 7: 

Table 7 depicts Seq ED No.. UnkjeneD, UnlgeneTtfle. Ptey, and ExAccn for all of (he sequences In Table 8. SeqtDNo Talks the nucleic add and protein 
sequence information inTabta 8 to Table 7. 
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75 



Pkey; 


Unique Eos probeset identifier number 




ExAccn: 


Exemplar Accession numl 


)ei, uenoanx accession nuinuci 




UnkjeneD: 


Unkjene number 






Unigene Title: Unigene gene tftte 






Seq.tD.No.: Sequence IdentiOcat 


linn Mihnharfaiinrf In Tahb ft 

uon iNumDer round in i ante o 




PKey 


ExAccn 


Unigene (D 


1 lataMA TIM* 

unigene uiue 


SEQ D NO 


101545 


BE246154 


Hs.154210 


endothelial atnerenrjauon, sprung oupi 


Seq ID 1 & 2 


115819 


AA486620 


Hs.41135 


endomucJn-2 


Seq ID 3 & 4 


424503 


NM-002205 


Hs.149609 


Integra), alpha 5 (fibronectin receptor, 


Seq ID5&6 


102917 


AJ016712 


Hs.287797 


integral, beta 1 (nbronecan receptor, 


SeqlD7&8 


102915 


X07820 


Hs.2258 


matrix metasoprotenase iu (stromsiysui 


Sea ID 9 & 10 


105330 


AW338625 


Hs^2120 


ESTs 


Seq ID 11 & 12 


107385 


NM.005397 


Hs.16426 


podocatyxin-like 


Seq ID 13 & 14 


102024 


AA301867 


Hs.76224 


EGF -containing Ebutin-iike extracellula 


Seq 1015 & 16 


102024 


AA301B67 


HsJ6224 


Cf*C jjl.iI !■!.!.> jlj. KW. Jtm lib/- -ml- i 

eGF -containing nouun-uxe extracellula 


Seq ID 17 & 18 


134416 


X68264 


Hs.211579 


melanoma ceil adhesion molecule 


Seq ID 19 & 20 


103036 


M13509 


Hs.83169 


matrix metaOopiotelnase 1 {interstitial 


Seq ID 21 & 22 


104865 


T79340 


Hs.22575 


B-ceO CLUIymphoma 6, member B (zinc fi 


SeqID23&24 


106124 


H93366 


HS7567 


Homo sapiens cdna. rui i y w its. crone n 


SeqtD25&26 


109001 


AI056548 


Hs.72116 


nypotneucai protein rutumsc sunuar 10 


Seq ID 27 & 28 


104764 


A1039243 


Hs.278585 


ESTs 


Seq ID29&30 


133200 


AB037715 


Hs.183639 


hunnHuiHMl nm+ntn CI HIT) 10 

nypomeucai protein ru luziu 


Seq ID 31 &32 


105263 


AW388633 


Hs.6682 


solute earner family 7, (cationic amino 


SeqlD33&34 


102892 


BE440042 


Hs.83326 


matrix metaBoproteinase 3 {stromefysin 


SeqlD35&36 


109456 


AW956580 


Hs.42699 


ESTs 


Seq ID 37 4 38 


110906 


AA035211 


Hs.17404 


ESTs 


SeqlD39&40 


119073 


BE245360 


Hs.279477 


tolS 


Seq ID 41 &42 


132050 


AI267615 


Hs.38022 


ESTs 


SeqlD43&44 


132490 


NIVL001290 


Hs.4980 


UNI domain binding 2 


Seq ID 45 & 46 


102283 


AW161552 


Hs.83381 


guanine nucleotide binding protein 1 1 


Seq ID 47 & 48 


101714 


M68874 


Hs.211587 


phosphoQpase A2, group iVA (cytosoGc, 


SeqlD49&50 


133975 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


Seq ID 51 &52 


106793 


H94997 


Hs.16450 


ESTs 


Seq ID 53 & 54 


118511 


N75620 


Hs.43157 


ESTs 


Seq ID 54 & 55 


101447 


M21305 




go.numan aipna saieuito ana saiemie o 


Seq ID56&57 


314941 


AA515902 


Hs.130650 


ESTs 


Seq 1D58&59 


332466 


AB018259 


Hs.1 18140 


K1AA0716 gene product 


SeqlD60&61 


313513 


AW298600 


Hs.141840 


ESTs, Weakly similar to SS9501 interfero 


SeqlD62&63 


313556 


AA628517 


Hs.118502 


ESTs 


SeqlD64&65 


313665 


AW751201 


Hs.51233 


ESTs 


Seq !D 66467 


314372 


AU040178 


Hs.142003 


ESTs 


Seq ID 68869 


429276 


AF056085 


Hs.198612 


G protein-coupled receptor 51 


Seq ID 70 & 71 


101345 


NM.005795 


Hs.152175 


calcitonin receptor-tike 


Seq ID 72473 


418994 


AA296520 


Hs.B9546 


setecfin E (endothelial adhesion molecul 


Seq ID 74 475 


103850 


AA187101 


Hs.213194 


hypothetical protein MGC10895 


Seq ID 76477 


133260 


AA403045 


Hs.6906 


Homo sapiens cONA: FU23197 fis, done R 


Seq ID 78 4 79 


101097 


BE245301 


Hs.89414 


chemoHne (C-X-C motif}, receptor 4 (fus 


Seq ID 80 481 


104786 


AA027167 


Ha.10031 


WAA0955 protein 


Seq ID 82483 


132173 


X89426 


Hs.41716 


endothelial cell-specific molecule 1 


Seq ID 84 485 


100420 


D86983 


Hs.1 18893 


Melanoma associated gene 


Seq ID 86 487 


111018 


AI287912 


Hi3628 


mftogen-adivated protein kinase kinase 


SeqlD88489 


108507 


AI554545 


Hs.68301 


ESTs 


Seq ID 90491 


104894 


AF065214 


Hs.18858 


phosphoijpase A2, group IVC (cytosofic, 


Seq ID 92 4 93 


118511 


N75620 


Hs.43157 


ESTs 


Seq ID 94 4 95 


125609 


AA868063 


Hs.104576 


carbohydrate (keratan sulfate GaW) sui 


Seq ID 96 497 


101543 


M31166 


Hs.2050 


penlaxifweiated gene, rapidly induced b 


Seq ID 98 499 


102241 


NM.Q07351 


Hs.268107 


muWmerta 


Seq ID 100 4 101 


101560 


AW958272 


Hs.347326 


Intercellular adhesion molecule 2 


Seq ID 1024103 


103280 


U84722 


HS76206 


cadherin 5. type 2, VE-cadherin (vascuta 


Seq ID 1044105 


105826 


AA478756 


Hs.194477 


E3 ubtquttln Ikjase SMURF2 


Seq ID 106 4107 


102804 


NM.002318 


Hsi3354 


fysyi oxidase -like 2 


Seq ID 1084109 


131647 


AA359615 


Hs.30089 


ESTs 


Seq ID 110 4111 


103095 


KNL005424 


Hs.78824 


tyrosine kinase with immunoglobulin and 


Seq ID 1124113 


103037 


BE018302 


Hs.2894 


placental growth factor, vascular endoth 


Seq ID 114 4 115 


100405 


AW291587 


Hs.82733 


nldogen2 


Seq ID 116 4 117 


102012 


BE259035 


Hs.118400 


singed prosophflaHke (sea urchin fas 


Seq ID 1184119 
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lUubi 


U.JUM/ 


rtS.0*CJOO 


proiein u receptor, enaouisuai itrvKj 




105729 


H46612 




UnmA M.U.. UCQT9fit: mDMA mitini /vis 

nomo sapiens nbrwoo mKNA, partial cos 


Con m 199 JL 177 
oeq IU \LL a ltd 


107216 


051069 


Uf»911C7Q 


melanoma ceO adhesion motecule 


oc(| iu lift a 1*3 


lolUou 


Mil /YMQcc 




endothelin 1 


Cm in 19fi A 177 
OWJ IU \£Sj owl 


131486 


F06972 


ns.z/or / 


BMX non-receptor tyrosino kinase 


Cm, m 15A A 190 
OUtflU ltD a ICS 


134299 


AW5B0939 


n3.S71S9 


complement component Clq receptor 


Can tn nn a m 

oeq IU lOU a 101 


134983 


028235 


Us ittCldA 


prost^landin-endoparaxkio synthase 2 (p 


Con m 179 A 177 

oeq iu 10/ a uo 


11 SR. 97 


A/M/BUUU 




acun icHPy protein «yj oompiQi, sixoun 


Can tn 17J A 


133614 




Hs.75232 


btoi4 (o. cerevtsaej-ute i 


Can in 17A i 177 

oeq iu loo 6 to/ 


116463 


Al04WUl 


U. 7M40 


ubtquitin carboxyt-terminal esterase L1 


Can in 17A A 17Q 

oeq mj ioo a 109 






Mo 1AA7A7 
rlS. 10O0OO 


Inlan»af1i alar atlhaaLw fwdor^ T^i 1 (f s T\KA\ 

iniercauinar aonesion nwiccuje i iisuoqj 


Can m ijin jl 1^1 


1 77C7H 


HWZlf 43l 


N/\ 


nucleoside phosphtxylase 


Can in 149 A 147 
Oct] IU a HJ 


130184 


H58306 


Hs«15165 


reunotc acid Induced 14 


Can m AAA ft 1/R 

beq IU 144 & 143 


134786 


T29618 


Us Q<xmc\ 


TEK tyrosine kinase, endothelial (venous 


Can in 14fi A 147 

oeq iu iso a i**/ 


129371 


aUCkJZo 


U« Hnft/W 


von WUksbrand factor 


Can In 14fl A 1AQ 
OetJ IU 1W a 1^3 


418506 


AA0o4Z4fl 


Hs.ooooy 


G protetn-coupted receptor 39 


Can in 4CTI ft 1R1 

oeq iu lou & 131 


322262 


AA632012 


Hs. 1Ba74o 


ESTs 


Can m 1R9 £ 1R7 

oeq iu is/ a 130 


312173 


Al 821409 


Us 7/M/171 

118.3044/1 


CCT 


Can in \U A IW 
Oeq IU 134 A 133 


319795 


AB037821 


ns. 140000 


nmfnmdhaidi Ifl 

protocaanenn iu 


Can in 1U ft 1T7 

oeq iu 130 a 13/ 


313978 


A W7fH 7R 

AI07U1/3 


Uo 170K7 

nsioyo/ 


CCTe 

to IS 


Can m 1<\A a i*;q 
oeq iu loo a loa 


306840 


AI077477 


U« 107019 


ESTs 


Can in IRQ ft 1K1 

oeq iu lou a loi 


310272 


ArZio3B9 


Uc 1/DQ79 


sema domain, transmembrane domain (TM), 


Can m 1fi9 A 1R7 

oeq iu loi a loo 


7M979 


fi.P91R7flQ 
rtr/lOOOd 


U, 14-Q99 
rw. IKJ3M. 




<5an IT) 164 A 1S5 


315044 


DC3470/4 


Us OfMIGQ 


to i s, weaKjy similar id 00303/ aipna-iu- 


Can in 1KR Jt 1fi7 

oeq iu loo a 10/ 


791 79e 
oYioZo 




Uo TtYlftifi 


KIAA1 274 protein (similar to mouse palad 


Can in 1Rft A 1RQ 

oeq iu loo a ids 


321325 


AB033100 


Us IftftRAP. 


iMAAiZf 4 protein [simuar to mouse paiaa 


Can in 170 A 171 

oeq iu i/u a i/i 


303251 


AF240635 


Ue 11*^07 

ns.1 looy/ 


nm(>vei<hsi4n 19 

prorxxscrtertn m 


Can in 179 A 177 

oeq iu i/£ a l/j 


302378 


AL1U9712 


Hs296506 


\J/^« A oaa]>»>a _DUa A ill U«j»I4, Ia.«4 jXIAI 

nomo sapiens nwv\ run rengtri Bisert cun 


Can in MA ft 17£ 

oeq iu i/4 a 1/3 


319267 


F11802 


Hs.6818 


ESTs 


Can in 17R ft 177 

oeq iu i/o a i// 


310442 


AW072215 


ii AAA i^n 

HS208470 


ESTs 


sieq IU l/o a i/o 


300469 




Uo ■MOQCC 

TO-^OOJOO 


nypoffiencaj proten rU2040i 


Can in 1 flA ft 1A1 

oeq iu lou a loi 


331237 


W87874 


II. ArnTJ 

H3.25277 


Homo sapiens cONA FU10717 lis; clone NT 


oeq 10 182 & loo 




R44557 


H323748 


ESTs 


oeq ID 184 A 163 


301822 


X17033 


U> 171Cfftft 


uiiBgnn, aipna l \laj49d, aipna l suouni 


Can in Iflfi ft 1ft7 

oeq iu loo a 10/ 


422573 


AW297965 


Hs.295726 


inte^ririj aipna V {vitronectin receptor 


Seq ID 188 & 189 


133061 


AJ186431 


Hs.296638 


prostate differentiation factor 


Seq ID 190 4 191 


135235 


AW298244 


Hs.266195 


ESTs 


Seq ID 192 & 193 


101192 


BE247295 


Hs.78452 


solute carrier family 20 (phosphate tran 


SeqlD194&195 


113195 


H83265 


Hs.8881 


ESTs, WeaWy similar to S41044 ctuomosom 


Seq 10 196 & 197 


101741 


NMJJ03199 


Hs.326198 


transcription (actor 4 


Seq ID 196 a 199 


321911 


AF026944 


Hs.293797 


ESTs 


Seq ID 200 & 201 


320635 


N50617 


H8.B0506 


small nuclear ribornjcteoproteto polype pt 


SeqlD202&203 


326230 






NM_017643:Homo sapiens hypothetical prat 


SeqiD204&205 


132966 


AF234532 


Hs.61638 


myosin X 


SeqtD206 &207 


135073 


W55956 


Hs. 94030 


Homo sapiens mRNA; cONA DKFZp586E1624 (f 


Seq tD 208 & 209 


106937 


AL050107 


Hs.24341 


transcrfpfional co-activator with PDZ-bi 


SeqlD210&211 


116430 


AK001531 


Hs.66048 


hypothetical protein RJ10669 


Seq ID 212 & 213 


104877 


AI138635 


H3.22988 


Homo sapiens done IMA6E:451939, mRNA se 


Seq ID 214 & 215 


122697 


AA420683 


Hs.98321 


hypothetical protein FU14103 


Seq ID 216 & 217 


112522 


R68857 


Hs.265499 


ESTs 


Seq ID 218 a 219 


304782 


AA532081 




gb:nn32h08.s1 NCI_CGAP_Gas1 Homo sapiens 


Seq ID 220 & 221 


312802 


AA644669 


Hs.193042 


ESTs 


Seq ID 222 & 223 


302680 


AW192334 


Hs.38218 


ESTs 


Seq D 224 & 225 


326198 






Phase2&3Exons 


Seq ID 226 & 227 


331019 


NM.006033 


Hs.65370 


lipase; endothelial 


Seq ID226&229 
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TABLE 8 



Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: NMJJ01400 

Coding sequence: 244-2208 (underlined sequences correspond to start and stop codons)) 



1 11 21 31 41 51 

I I I I I I 

GTCGGGGGCA GCAGCAAGAT GCGAAGCGAG CCGTACAGAT CCCGGGCTCT COGAACGCAA 60 

CTTCGCCCTG CTTGAGCGAG GCTGCGGTTT CCGAGGCCCT CTCCAGCCAA GGAAAAGCTA 120 

CACAAAAAGC CTGGATCACT CATCGAACCA CCCCTGAAGC CAGTGAAGGC TCTCTCGCCT 180 

CGCCCTCTAG CGTTCGTCTG GAOTAGCGCC ACCCOGGCTT CCTGGGGACA CAGGGTTGGC 240 

ACCaiGGGGC CCACCAGOGT CCCGCTGGTC AAGGCCCACC GCAGCTCGGT CTCTGACTAC 300 

GTCAACTATG ATATCATCGT CCGGCATTAC AACTACACGG GAAAGCTGAA TATCAGCGCG 360 

GACAAGGAGA ACAGCATTAA ACTGACCTCG GTGGTGTTCA TTCTCATCTG CTGCTTTATC 420 

ATCCTGGAGA ACATCTTTGT CTTGCTGACC ATTTGGAAAA CCAAGAAATT CCACCGACCC 480 

ATGTACTATT TTATTGGCAA TCTGGCCCTC TCAGACCTGT TGGCAOGAGT AGCCTACACA S40 

GCTAACCTGC TCTTGTCTGG GGCCACCACC TACAAGCTCA CTCCCGCCCA GTGGTTTCTG 600 

CGGGAAGGGA GTATGTTTGT GGCCCTGTCA GCCTCOGTGT TCAGTCTCCT CGCCATCGCC 660 

ATTGAGCGCT ATATCACAAT GCTGAAAATG AAACTCCACA ACGGGAGCAA TAACTTCCGC 720 

CTCTTCCTGC TAATCAGCGC CTGCTGGGTC ATCTCCCTCA TCCTGGGTGG CCTGCCTATC 780 

ATGGGCTGGA ACTGCATCAG TGCGCTGTCC AGCTGCTCCA CCGTGCTGCC GCTCTACCAC 840 

AAGCACTATA TCCTCTTCTG CACCACGGTC TTCACTCTGC TTCTGCTCTC CATCGTCATT 900 

CTGTACTGCA GAATCTACTC CTTGGTCAGG ACTCGGAGCC GCCGCCTGAC GTTCCGCAAG 960 

AACATTTCCA AGGCCAGCCG CAGCTCTGAG AAGTCGCTGG CGCTGCTCAA GACCGTAATT 1020 

ATCGTCCTGA GCGTCTTCAT CGCCTGCTGG GCACCGCTCT TCATCCTGCT CCTGCTGGAT 1080 

GTGGGCTGCA AGGTGAAGAC CTGTGACATC CTCTTCAGAG CGGAGTACTT CCTGGTGTTA 1140 

GCTGTGCTCA ACTCCGGCAC CAACCCCATC ATTTACACTC TGACCAACAA GGAGATGCGT 1200 

CGGGCCTTCA TCCGGATCAT GTCCTGCTGC AAGTGCCCGA GCGGAGACTC TGCTGGCAAA 1260 

TTCAAGCGAC CCATCATCGC CGGCATGGAA TTCAGCCGCA GCAAATCGGA CAATTCCTCC 1320 

CACCCCCAGA AAGACGAAGG GGACAACCCA GAGACCATTA TGTCTTCTGG AAACGTCAAC 1380 

TCTTCTTCCT AGAACTGGAA GCTGTCCACC CACCGGAAGC GCTCTTTACT TGGTCGCTGG 1440 

CCACCCCAGT GTTTGGAAAA AAATCTCTGG GCTTCGACTG CTGCCAGGGA GGAGCTGCTG 1500 

CAAGCCAGAG GGAGGAAGGG GGAGAATACG AACAGCCTGG TGGTGTCGGG TGTTGGTGGG 1560 

TAGAGTTAGT TCCTGTGAAC AATGCACTGG GAAGGGTGGA GATCAGGTCC CGGCCTGGAA 1620 

TATATATTCT ACCCCCCTGG AGCTTTGATT TTGCACTGAG CCAAAGGTCT AGCATTGTCA 1680 

AGCTCCTAAA GGGTTCATTT GGCCCCTCCT CAAAGACTAA TGTCCCCATG TGAAAGCGTC 1740 

TCTTTGTCTG GAGCTTTGAG GAGATGTTTT CCTTCACTTT AGTTTCAAAC CCAAGTGAGT 1800 

GTGTGCACTT CTGCTTCTTT AGGGATGCCC TGTACATCCC ACACCCCACC CTCCCTTCCC 1860 

TTCATACCCC TCCTCAACGT TCTTTTACTT TATACTTTAA CTACCTGAGA OTTATCAGAG 1920 

CTGGGGTTGT GGAATGATCG ATCATCTATA GCAAATAGGC TATGTTGAGT AOGTAGGCTO 1980 

TGGGAAGATG AAGATGGTTT GGAGGTGTAA AACAATGTCC TTCGCTGAGG CCAAAGTTTC 2040 

CATGTAAGCG GGATCCGTTT TTTGGAATTT GGTTGAAGTC ACTTTGATTT CTTTAAAAAA 2100 

CATCTTTTCA ATGAAATGTG TTACCATTTC ATATCCATTG AAGCCGAAAT CTGCATAAGG 2160 

AAGCCCACTT TATCTAAATG ATATTAGCCA GGATCCTTGG TGTCCTAGGA GAAACAGACA 2220 

AGCAAAACAA AGTGAAAACC GAATGGATTA ACTTTTGCAA ACCAAGGGAG ATTTCTTAGC 2280 

AAATGAGTCT AACAAATATG ACATCCGTCT TTCCCACTTT TGTTGATGTT TATTTCAGAA 2340 

TCTTGTGTGA TTCATTTCAA GCAACAACAT GTTGTATTTT GTTGTGTTAA AAQTACTTTT 2400 

CTTGATTTTT GAATGTATTT GTTTCAGGAA GAAGTCATTT TATGGATTTT ' TCTAACCCGT 2460 

GTTAACTTTT CTAGAATCCA CCCTCTTGTO CCCTTAAGCA TTACTTTAAC TGGTAGGGAA 2520 

CGCCAGAACT TTTAAGTCCA GCTATTCATT AGATAGTAAT TGAAGATATG TATAAATATT 2580 

ACAAAGAATA AAAATATATT ACTGTCTCTT TAGTATGGTT TTCAGTGCAA TTAAACCGAG 2640 

AGATGTCTTQ TTTTTTTAAA AAGAATAGTA TTTAATAGGT TTCTGACTTT TGTGGATCAT 2700 
TTTGCACATA GCTTTATCAA CTTTTAAACA TTAATAAACT GATTTTTTTA AAG 



Seq ID NO: 2 Protein sequence; 
protein Accession #: Np_001391 



1 11 21 31 41 51 

1 I I I I I 

MGPTSVPLVK AHRSSVSDYV NYDIIVRHYN YTGKLNISAD KENSIKLTSV VPILICCFII 60 

LENIFVLLTI WKTKKFHRPM YYFIGNLALS DUiAGVAYTA NLLLSGATTY KLTPAQWFLR 120 

EGSMFVALSA SVFSLLAIAI ERYXTMLKMK LHNGSNNFRL FTiIiISACWVI SLILGGLPIM 180 

GWNCISALSS CSTVLPLYHK HYILFCTTVP TLLLLSIVTL YCRIYSLVRT RSRRLTFRKN 240 

ISKASRSSEK SLALLKTVTI VLSVFIACWA PLFILLLLDV GCKVICTCDIL FRAEYFIiVUV 300 

VLNSGTNPII YTLTNKEMRR APIRIMSCCK CPSGDSAGKP KRPIIAGMEF SRSKSDNSSH 360 
PQKDEGDNPS TIMSSGNVNS SS 



179 



WO 02/079492 



PCT/US02/04915 



Seq ID NO: 3 Nucleotide sequence: 
Nucleic Acid Accession #: NMJ>16242 

Coding sequence: 79-864 (underlined sequences correspond to start and stop codona)) 



1 11 21 31 41 51 

I I I I I I 

AAGGCCCTOC CAGCTTGGGA GGGAATTGTC CCTGCCTGCT TCTGGAGAAA GAAOATATTQ 60 

ACACCATCTA CGGGCACCAX 2GAACTGCTT CAAGTQACCA TTCTTTTTCT TCTQCCCAGT 120 

ATTTGCAGCA GTAACAGCAC AGOTGTTTTA OAGQCAGCTA ATAATTCACT TOTTGTTACT 180 

ACAACAAAAC CATCTATAAC AACACCAAAC ACAQAATCAT TACAQAAAAA TGTTQTCACA 240 

CCAACAACTQ GAACAACTCC TAAAGGAACA ATCACCAAT6 AATTACTTAA AATGTCTCTG 300 

ATGTCAACAG CTACTTTTTT AACAAGTAAA OATQAAGGAT TGAAAOCCAC AACCACTGAT 360 

GTCAGGAAGA ATGACTCCAT CATTTCAAAC GTAACAGTAA CAAGTGTTAC ACTTCCCAAT 420 

GCTQTTTCAA CATTACAAAG TTCCAAACCC AAGACTGAAA CTCAGAGTTC AATTAAAACA 480 

ACAGAAATAC CAGGTAGTGT TCTACAACCA GATGCATCAC CTTCTAAAAC TGGTACATTA 540 

ACCTCAATAC CAGTTACAAT TCCAGAAAAC ACCTGACAGT CTCAAGTAAT AGACACTGAG 600 

GGTGGAAAAA ATGCAAGCAC TTCAGCAACC AGCCGGTCTT ATTCCAGTAT TATTTTGCCG 660 

GTGGTTATTO CTTTGATTGT AATAACACTT TCAGTATTTG TTCTGGTGGG TTTGTACCGA 720 

ATGTGCTGGA AGGCAGATCC GGGCACACCA GAAAATGGAA ATGATCAACC TCAGTCTGAT 780 

AAAGAGAGCG TGAAGCTTCT TACCGTTAAG ACAATTTCTC ATGAGTCTGG TGAGCACTCT 840 

GCACAAGGAA AAACCAAGAA CTGACAGCTT GAGGAATTCT CTCCACACCT AGGCAATAAT 900 

TAOGCTTAAT CTTCAGCTTC TATGCACCAA GCGTGGAAAA GGAGAAAGTC CTGCAGAATC 960 
AATCCOGACT TCCATACCTG CTGCTGG 



Seq ID NO: 4 protein sequence: 
Protein Accession #: NP_0S7326 

1 11 21 31 41 51 

I I I I I I 

MELIiQVTILF ULPSICSSNS TGVLBAANNS LWTTTKPSI TTPNTESLQK NWTPTTGTT 60 
PKGTITNELL KMSLMSTATP LTSKDEGLKA TTTDVRKNDS IISNVTVTSV TLPNAVSTLQ 120 
SSKPKTBTQS SHCTTBIPGS VLQPDASPSK TGTLTSIPVT IPENTSQSQV IDTEGGKNAS 180 
TSATSRSYSS IILPWIALI VITLSVPVLV GLYRMCWKAD PGTPENGNDQ PQSDKESVKL 240 
LTVKTISHES GEHSAQGKTK N 

Seq ID NO: 5 Nucleotide sequence^ 
Nucleic Acid Accession #: NM_002205 , 

Coding sequence: 24.. 3173 {underlined sequences correspond to start and stop codona) 



1 11 21 31 41 51 

I I I I I I 

CAGGACAGGG AAGAGCGGGC GCTATGGGGA GCCGGACGCC AGAGTCCCCT CTCCACGCCG 60 

TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCTG TTGCTGCTCG 120 

TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCCA GCAGTACTCT 180 

CGGGGCCCCC GGGCTCCTTC TTOGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 240 

GGGTCAGTGT GCTGGTGGGA GCACCCAAGG CTAATACCAG CCAGCCAGGA GTGCTGCAGG 300 

GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 360 

TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG GGAGAGGAGC 420 

CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT GGCTCCTCCA 480 

TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 540 

TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT GCACCCTGCC 600 

GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC AGTGCCGAGT 660 

TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG CAAGGCCAGA 720 

TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC CTGATCAACC 780 

TGGTTCAGGG GCAGCTGCAG ACTCGCCAGG CCAGTTCCAT CTATGA TGAC AGCTACCTAG 840 

GATACTCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 900 

TGCCCAAAGG GAACCTCACT TACGGCTATG TCACCATCCT TAATGGCTCA GACATTCGAT 960 

CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA GTGGCCGCCA 1020 

CAGACGTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG CTCATGGATC 1080 

GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTA CCTG CAGCACCCAG 1140 

CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGAGTTT GGCCGATTTG 1200 

GCAGCTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT GTGGCCATCG 1260 

GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT GGGGGCCCAG 1320 
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GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 1380 

CAGACTTCTT TGOCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT GGATATCCTG 1440 

ATCTGATTGT GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAOGGGC CGCCCCATCG 1500 

TGTCCGCTAG TGCCTCCCTC ACCATCTTCC COOCCATGTT CAACOCAGAG GAGCGGAGCT 1560 

5 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTOCCTC AATGCTTCTO 1620 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGCAGAAGC 1680 

AGAAGGGAGG GGTACGGOGG GCACTGTTCC TGGCCTCCAG GGAGGCAACC CTGACCCAGA 1740 

CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC TACCTCAGGA IB 00 

ACQAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT OGCTCTCAAC TTCTCCTTGG I860 

10 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 1920 

GCOGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCTGTGTGC 1980 

CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 

CCCTGAACCT CACTTTCCAT GCCCAGAATO TGGGTGAGGG TGGGGCCTAT GAGGCTGAGC 2100 

TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACAC CCAGGGAACT 2160 

15 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGG CCTTOGGTTT ACAGTCCCTC 2280 

ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC AAGAATCTCA 2340 

ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCOGT GGAGGCTCAG GCCCAGGTCA 2400 

CCCTGAAOGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 

20 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC TATGAGCTCA 2520 

TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 

TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTAGGGG ACTCAACTGC ACCACCAATC 2640 

ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CACCAGCAAA 2700 

AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTOGGGACC TCAGATCCT3 AAATGCCCGG 2760 

25 AGGCTGAGTG TTTCAGGCTG OGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 2820 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2880 

TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA ATCCTGCCTC 2940 

GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC AAGGCAGAAG 3000 

GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 

30 TAGGTCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC CTCCCATATG 3120 

GCACCGCCAT GGAAAAAGCT CAGCTGAAGC CTCCAGCCAC CTCTGATGCC TQAQTCCTCC 3180 

CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CTGAAAAGGA 3240 

GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGAG AAGCTCCTCT CCCCAGCCCA 3300 

GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 3360 

35 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA CTCAGATCCA 3420 

GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT GAAACAACTG 3480 

GAAAGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC AAGGACTTCC 3540 

TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA AAAGAAGCCC 3600 

CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG CCCCACCCTG 3660 

40 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA GGACAGATGC 3720 

CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC AGAACTCAAA 3780 

TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GACATTGGCA 3840 

CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3900 

AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCTGAAGGGG CCATGATGGC 3960 

45 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC AGAGAAAGGG 4020 

GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA CTGATGCTGC 4080 

CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG GACTTATTTA 4140 

AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCC ACTGACCAGA ACTAGAAAAA 4200 
AAAA 



50 



Seq ID NO: 6 prgtein sequence t 
Protein Accession #x NP 002196.1 



55 1 11 21 31 41 51 

I I i I I I 

MGSRTPBSPL HAVQLRWGPR RRPPLVPLLL LLVPPPPHVG GPNLDAEAPA VLSGPPGSFP 60 

GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG BBPVEYKSLQ WPGATVRAHG SSIIACAPLY SWRTBKEPLS DPVGTCYLST 180 

60 DNFTRILEYA PCRSDFSWAA GQGYCQGGF8 AEPTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNLTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

BVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPLWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

65 VDKAWYRGR PIVSASASLT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADS I 540 

GPTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNE5EFRDK 600 

IiSPIHIALNF SLDPQAPVDS KGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE ABYSGLVRHP GNFSSL8CDY 720 

FAVNQSRItLV CDLGNPMKAG ASliWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

70 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPS8Z 840 

SQGVLELSCP QALBGQQLLY VTRVTGLNCT TNHPINPKGL ELD PEGS LHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQRBH QPPSLQCEAV 960 

YKALKMPYRI LPRQLPQKBR QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKLGFFKRSL PYGTAMEKAQ LKPPATSDA 
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Seq ID NO: 7 Nucleotide sequence; 
Nucleic Acid Accession #: NMJJ02211 

Coding sequence: 104 . .2500 (underlined sequences correspond to Btart and stop codons) 



1 11 21 31 41 SI 

I I I I I I 

GTCCGCCAAA ACCTGCGCGG ATAGGGAAGA ACAGCACCCC GGCGCCGATT GCCGTACCAA 60 

ACAAGCCTAA CGTCCGCTGG GCCCCGGACG CCGCGCGGAA AAGATGAATT TACAACCAAT 120 

TTTCTGGATT GGACTGATCA GTTCAGTTTG CTGTGTGTTT GCTCAAACAG ATGAAAATAG 180 

ATGTTTAAAA GCAAATGCCA AATCATGTGG AGAATGTATA CAAGCAGGGC CAAATTGTGG 240 

GTGGTGCACA AATTCAACAT TTTTACAGGA AGGAATGCCT ACTTCTGCAC GATGTGATGA 300 

TTTAGAAGCC TTAAAAAAGA AGGGTTGCCC TCCAGATGAC ATAGAAAATC CCAGAGGCTC 360 

CAAAGATATA AAGAAAAATA AAAATGTAAC CAACCGTAGC AAAGGAACAG CAGAGAAGCT 420 

CAAGCCAGAG GATATTACTC AGATCCAACC ACAGCAGTTG GTTTTGCGAT TAAGATCAGG 480 

GGAGCCACAG ACATTTACAT TAAAATTCAA GAGAGCTGAA GACTATCCCA TTGACCTCTA 54 0 

CTACCTTATG GACCTGTCTT ATTCAATGAA AGACGATTTG GAGAATGTAA AAAGTCTTGG 600 

AACAGATCTG ATGAATGAAA TGAGGAGGAT TACTTCGGAC TTCAGAATTG GATTTGGCTC 660 

ATTTGTGGAA AAGACTGTGA TGCCTTACAT TAOCACAACA CCAGCTAAGC TCAGGAACCC 720 

TTGCACAAGT GAACAGAACT GCACCACCCC ATTTAGCTAC AAAAATGTGC TCAGTCTTAC 780 

TAATAAAGGA GAAGTATTTA ATGAACTTGT TGGAAAACAG GGCATATCTG GAAATTTGGA 840 

TTCTCCAGAA GGTGGTTT03 ATGCCATCAT GCAAGTTGCA OTTTGTGGAT CACTGATTGG 900 

CTGGAGGAAT GTTACACGGC TGCTGGTGTT TTCCACAGAT GCCGGGTTTC ACTTTGCTGG 960 

AGATGGGAAA CTTGGTGGCA TTGTTTTACC AAATGATGGA CAATGTCACC TGGAAAATAA 1020 

TATGTACACA ATGAGCCATT ATTATGATTA TCCTTCTATT GCTCACCTTG TCCAGAAACT 1080 

GAGTGAAAAT AATATTCAGA CAATTTTTGC AGTTACTGAA GAATTTCAGC CTGTTTACAA 1140 

GGAGCTGAAA AACTTGATCC CTAAGTCAGC AGTAGGAACA TTATCTGCAA ATTCTAGCAA 1200 

TGTAATTCAG TTGATCATTG ATGCATACAA TTCCCTTTCC TCAGAAGTCA TTTTGGAAAA 1260 

CGGCAAATTG TCAGAAGGAG TAACAATAAG TTACAAATCT TACTGCAAGA ACGGGGTGAA 1320 

TGGAACAGGG GAAAATGGAA GAAAATGTTC CAATATTTCC ATTGGAGATG AGGTTCAATT 1380 

TGAAATTAGC ATAACTTCAA ATAAGTGTCC AAAAAAGGAT TCTGACAGCT TTAAAATTAG 1440 

GCCTCTGGGC TTTACGGAGG AAGTAGAGGT TATTCTTCAG TACATCTGTG AATGTGAATG 1500 

CCAAAGCGAA GGCATCCCTG AAAGTCCCAA GTGTCATGAA GGAAATGGGA CATTTGAOTG 1560 

TGGCGCGTGC AGGTGCAATG AAGGGCGTGT TGGTAGACAT TOTGAATGCA GCACAGATGA 1620 

AGTTAACAGT GAAGACATGG ATGCTTACTG CAGGAAAGAA AACAGTTCAG AAATCTGCAG 1680 

TAACAATGGA GAGTGCGTCT GCGGACAGTG TGTTTGTAGG AAGAGGGATA ATACAAATGA 1740 

AATTTATTCT GGCAAATTCT GCGAGTGTGA TAATTTCAAC TGTGATAGAT CCAATGGCTT 180 0 

AATTTGTGGA GGAAATGGTG TTTGCAAGTG TCGTGTGTGT GAGTGCAACC CCAACTACAC 1860 

TGGCAGTGCA TGTGACTGTT CTTTGGATAC TAGTACTTOT GAAGCCAGCA ACGGACAGAT 1920 

CTGCAATGGC CGGGGCATCT GCGAGTGTGG TGTCTGTAAG TGTACAGATC CGAAGTTTCA 1980 

AGGGCAAAOG TGTGAGATGT GTCAGACCTG CCTTGGTGTC TSTGCTGAGC ATAAAGAATG 204 0 

TGTTCAGTGC AGAGCCTTCA ATAAAGGAGA AAAGAAAGAC ACATGCACAC AGGAATGTTC 2100 

CTATTTTAAC ATTACCAAGG TAGAAAGTOG GGACAAATTA CCCCAGCCGG TCCAACCTGA 2160 

TCCTGTGTCC CATTGTAAGG AGAAGGATGT TGACGACTGT TGGTTCTATT TTACGTATTC 2220 

AGTGAATGGG AACAACGAGG TCATGGTTCA TGTTGTGGAG AATCCAGAGT GTCCCACTGG 2280 

TCCAGACATC ATTCCAATTG TAGCTGGTGT GGTTGCTGGA ATTGTTCTTA TTGGCCTTGC 2340 

ATTACTGCTG ATATGGAAGC TTTTAATGAT AATTCATGAC AGAAGGGAGT TTGCTAAATT 2400 

TGAAAAGGAG AAAATGAATG CCAAATGGGA CACGGGTGAA AATCCTATTT ATAAGAGTGC 2460 

CGTAACAACT GTGGTCAATC CGAAGTATGA GGGAAAATGA GTACTGCCCG TGCAAATCCC 2520 

ACAACACTGA ATGCAAAGTA GCAATTTCCA TAGTCACAGT TAGGTAGCTT TAGGGCAATA 2580 

TTGCCATGGT TTTACTCATG TGCAGGTTTT GAAAATGTAC AATATGTATA ATTTTTAAAA 2640 

TGTTTTATTA TTTTGAAAAT AATGTTGTAA TTCATGCCAG GGACTGACAA AAGACTTGAG 2700 

ACAGGATGGT TATTCTTGTC AGCTAAGGTC ACATTGTGCC TTTTTGACCT TTTCTTCCTG 2760 

GACTATTGAA ATCAAGCTTA TTGGATTAAG TGATATTTCT ATAGCGATTG AAAGGGCAAT 2820 

AGTTAAAGTA ATGAGCATGA TGAGAGTTTC TGTTAATCAT GTATTAAAAC TGATTTTTAG 2880 

CTTTACATAT GTCAGTTTGC AGTTATGCAG AATCCAAAGT AAATGTCCTG CTAGCTAGTT 2940 

AAGGATTGTT TTAAATCTGT TATTTTGCTA TTTGCCTGTT AGACATGACT GATGACATAT 3000 

CTGAAAGACA AGTATGTTGA GAGTTGCTGG TGTAAAATAC GTTTGAAATA GTTGATCTAC 3060 

AAAGGCCATG GGAAAAATTC AGAGAGTTAG GAAGGAAAAA CCAATAGCTT TAAAACCTGT 3120 

GTGCCATTTT AAGAGTTACT TAATGTTTGG TAACTTTTAT GCCTTCACTT TACAAATTCA 3180 

AGCCTTAGAT AAAAGAACOG AGCAATTTTC TGCTAAAAAG TCCTTGATTT AGCACTATTT 3240 

ACATACAGGC CATACTTTAC AAAGTATTTG CTGAATGGGG ACCTTTTGAG TTGAATTTAT 3300 

TTTATTATTT TTATTTTGTT TAATGTCTGG TGCTTTCTAT CACCTCTTCT AATCTTTTAA 3360 

TGTATTTGTT TGCAATTTTG GGGTAAGACT TTTTTATGAG TACTTTTTCT TTGAAGTTTT 3420 

AGCGGTCAAT TTGCCTTTTT AATGAACATG TGAAGTTATA CTGTGGCTAT GCAACAGCTC 3480 

TCACCTACGC GAGTCTTACT TTGAGTTAGT GCCATAACAG ACCACTGTAT GTTTACTTCT 3540 

CACCATTTGA GTPGCCCATC TTGTTTCACA CTAOTCACAT TCTTGTTTTA AGTGCCTTTA 3600 
GTTTTAACAG TTCA 

Seq ID NO: 8 Protein sequence; 
Protein Accession #: NP_002202 



1 11 21 31 41 51 

I I I I I 1 

MNLQPIFWIG LISSVCCVPA QTDENRCLKA NAKSCGBCIQ AGPNCGWCTN STFLQEGMPT 60 
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9ARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTABKLKPED ITQIQPQQLV 120 

LRLRSGBPQT FTLKFKRABD YPIDLYYLMD LSYSMXDDLS NVKSLGTDLM NEMRRITSDF 180 

RIGFGSPVEK TVKFYISTTP AKLRNPCTSB QNCTSPPSYK NVLSLTNKGB VFNELVGKQR 240 

ISGNLDSPBG GFDAIMQVAV CGSLIGWRNV TRLLVF9TDA GFHFAGDGKL GGIVLPNDGQ 300 

CHLENNMYTM SHYYDYPSIA HLVQKIiSENN IQTIFAVTEB FQPVYKELKN LIPKSAVGTL 360 

SANSSNVIQL IIDAYNSLSS BVILBNGKLS EGVTISYKSY CKKGVNGTGE NGRXCSNISI 420 

GDBVQFEISI TSNKCPKKDS DSFKIRPLGF TEEVEVILQY ICECECQSEG IPSSPKCHEG 480 

NGTFECGACR CNEGRVGRHC BCSTDEVNSE DMDAYCRKEN SSEICSHNGB CVCGQCVCRK 54 0 

RDNTNEIYSG KFCECDNFNC DRSNGLICGG NGVCKCRVCB CNPKYTGSAC DCSLDTSTCE 600 

ASNGQICMGR GICECGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR AFNKGEKKDT 660 

CTQECSYFNI TKVBSRDKLP QPVQPDFVSH CKBKDVODCW PYPTYSVNGN NEVMVHWEN 720 

PECPTGPDII PIVAGWAGI VLIGIALLLI WKLLMIIHDR REFAKFEKEK MNAKWDTGEN 780 
PIYKSAVTTV VNPKYEGK 

Seq ID NO: 9 nucleotide sequence; - 
Nucleic Acid Accession #:NM_00242S 

Coding sequence: 23.. 1453 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I { I I 

AAAGAAGGTA AGGGCAGTGA GAATGATQCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 240 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCCIGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGQAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAOCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1140 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 

GTTACATTGC TAGGCGAQAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 1560 

GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTrT GGGCCTGTTC 1740 
CTT 



Seq ID NO: 10 Protsta sequence: 
Protein Accession #: NP_002416 

1 11 21 31 41 51 

I I I I I I 

HMHLAFLVLL CLPVCSAYPL SGAAKEEDSN KDLAQQYLBK YYNLEKDVKQ FRRKDSNLIV 60 
KKIQGMQKPI* GLEVTGKLDT DTLEVMRKPR OGVPDVGKPS SFPGMPKWRK THLTYRIVNY 120 
TPDLPRDAVD SAIEKALKVW EBVTPLTFSR LYEGEADIMI SFAVKEHGDF YSFDGPGHSL 180 
AHAYPPGPGL YGDIHFDDDE KWTEDASGTN LFLVAAHELG HSLGLFHSAN TEAU1YPLYN 240 
SFTELAQPRL 8QDDVNGIQS LYGPPPASTB BPLVPTKSVP SGSBMPAKCD PALS PDA I ST 300 
LRGEYLFFKD RYFWRRSHWN PEPEFHIiISA EWPSLPSYLD AAYEVHSRDT VPIPKGNEFW 360 
AIRGNBVOAG YPRGIHTLGF PPTIRKIDAA VSDKEKKKTY FFAADKYWRF DENSQSMEQG 420 
FPRLIADDFP GVEPKVDAVL QAFGFFYFFS GSSQFEFDPN ARMVTHILKS NSWLRC 



Seq ID NO: 11 Nucleotide sequence: 
Nucleic Acid Accession #: XM_05B1B9 

Coding sequence: 169.. 774 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I 1 1 

GAAGACCAGC TCAGCTCTTC AGTTQTTGAT CATTGTCTAT TGTTCTCCAA ACAGTAAACC 60 



183 



WO 02/079492 



PCMJS02/04915 



AGTATTTCAC ACTGAGATTG TCGGCTGCGG GTATATTCCA ATTCCCCGTC TCCTCATGAA 120 

TATGAAGTGA AGGGCTCTGA CCCTGGAAGT GGTTCTAAGC AGGGCAA AAT GG GGTCTCGQ 180 

AAGTGTGGAG GCTGCCTAAG TTGTTTGCTG ATTCCGCTTG CACTTTCGAG TATAATCGTG 240 

AACATATTAT TGTATTTCCC GAATGGGCAA ACTTCCTATG CATCCAGCAA TAAACTCACC 300 

AACTACGTGT GGTATTTTGA AGGAATCTGT TTCTCAGGCA TCATGATGCT TATAGTAACA 360 

ACAGTTCTTC TGGTACTGGA GAATAATAAC AACTATAAAT GTTGCCAGAG TGAAAACTGC 420 

AGCAAAAAAT ATGTGACACT GCTGTCAATT ATCTTTTCTT CCCTCGGAAT TQCTTTTTCT 480 

GGATACTGCC TGGTCATCTC TGCCTTGGGT CTTGTCCAAQ GGCCATATTG CCGCACCCTT 540 

GATGGCTGGG AGTATGCTTT TGAAGGCACT GCTGOACGTT TCCTTACAGA TTCTAGCATA 600 

TGGATTCAGT GCCTGGAACC TGCACATGTT GTGGAGTGGA ACATCATTTT ATTTTCCATT 660 

CTCATAACCC TCAGTGGGCT TCAAGTGATC ATCTGCCTCA TCAGAGTAGT CATGCAACTA 720 

TCCAAGATAC TGTGTGGAAG CTATTCAGTQ ATCTTCCAGC CTGGAATCAT TTGAA TAAGG 780 

ACAAAATGTT TTCCATTATC AAGACATGGC CATCTATCTA AATATTATAT CAACTGTGTA 840 

GACTTGAGGG CAATATTGAA ATGATGGTGC TTTCTOCATT TGGTGTTTAT TTGTAAAAAA 900 

TTTGCAGTCC TCACTGCACA TGCAAGTATA OCACCCTTCC ATTTAGTATG TTTTTTAAGT 960 

AATATGCATC AGAAACTTCA GAAATACTTC TGCCCTTTGA TCAAACAAAT CCATTTCCAA 1020 

GAATCTOTAC TAGGGAAGTA AATAAGAATA TGAGAGAAAC CTTTATGCAA ATATGTATAT 1080 

TGCAACATTA TTTAATATTC TGGAAAATTG GAAACACCCC AAAATTCTAA ACTCAOAGGA 1140 

AGGATTAAGT AAAGAGTGGT ACATACTGTA AATGTTTTCT GATATTAAAA AAAAAATTAA 1200 
ATAAAAAATA AAGAGTACTA CATGGTTGTA AAA 

Seq ID NO: 12 Protean aftguencq? 
Protein Accession ft: XP_05B189 



1 11 21 

I i I 

MGSRXCGGCL SCLLIPIiALW SIIVNIIiYF 
LIVTTVLLVL ENNNNYKCCQ SENCSKKYVT 
CRTLDGWBYA PEGTAGRFLT OSSIWIQCLE 
VMQLSKILCG SYSVIFQPGI I 



31 41 51 

1 i I 

PNGQTSYASS NKLTNYVWYF EGICPSGIMM 60 

LLSIIPSSLG IAFSGYCLVI SALGLVQGPY 120 

PAHWEWNII LPSILITLSG LQVIICLIRV 180 



Seq ID NO: 13 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005397 

Coding sequence: 251. .1837 (underlined sequences correspond to start and stop codons) 



1 
I 

AAACGCCGCC 
CAGCCCGGCT 
CGGGCCACAG 
TACCGCCCGG 
CGACGACACG 
GCCGCTGCTG 
CCAGACTACT 
CATGGCTACA 
GGCCTCGGTC 
GGCTCAGCAA 
CCCTACTACC 
AACCTCCACA 
AACAAACTCT 
AGAACATCTG 
GCATCCTGTG 
TTCAAGCACT 
ACCGTGATCG 
TACGGCCCCT 
ACCTACCCTG 
CCCCAAAACA 
TGAGACACAG 
TGCAGGGGGC 
CTTCAACCCG 
CGTGGTCGTC 
GCTGAAGGAC 
CCAGGGGCCA 
CTGCATGGCG 
CTCCCAGAGG 
CCATGACAAC 
GGTCAGCCTC 
GGACGACCTG 
CAGCACCACA 
CTGGGGAGGG 
TTTCCCTTTT 
CTAGTGCCTG 
GACTTTATAG 
GAGGGTAAGT 



11 
I 

CAGGACGCAG 
CTGCTGCAGC 
CCTGGCCTCC 
ACGCGCGGAT 
ATGC GCTGCG 
CCGTCGTCGC 
ACGGACTCAT 
QATACAGCCC 
AAOGCGACCA 
GTCTCAGGCC 
ACCATCGAGA 
GCCACAGCTA 
GGGGGGAAAA 
ACGACCCCTC 
GCCACCCCAA 
GTGGCTATCC 
GTTATCTCGC 
TCCTCCCAGG 
CCAGAGACCA 
CCTTCTCCCA 
ACACAGAGTG 
GCTTCGGATG 
GCCCAAGATA 
AAAGAAATCA 
AAATGGGATG 
CGGGAGGAGG 
TCATTCCTGC 
AAGGACCAGC 
CCAACACTGG 
AAOGGGGAGC 
GATGAGGAGG 
GAGCTCCAGA 
AGAGTGAACT 
CAACCTGAAC 
AGCTCAGTGC 
ATGAACTAGT 
GACTTGCCCA 



21 

I 

CCGCCGCCGC 
GGCAGGGAGG 
GGAGCCACCC 
CCTCCGCCGG 
OGCTGGCGCT 
CGTCGCCGTC 
CTAACAAAAC 
AGCAGAGCAC 
CCCTTGGTGT 
CAGTCAACAC 
GCCCCAAGAG 
AACCTAACAC 
GCAGCCACAG 
ACCCTACAAG 
CAAGCTCGGG 
CTGGCTACAC 
AAAGAACTCA 
AGACAGTGCA 
TGAGCTCCAG 
CTGTGGCTCA 
AGAAGCAGCT 
AGAAATTGAT 
AGTGCGGCAT 
CTATTCACAC 
AACTAAAGGA 
CCGAGGACCG 
TCCTCGTGGC 
AGCGGCTAAC 
AAGTGATGGA 
TGGGGGACAG 
AAGACACACA 
CCAACCACCC 
CCGAGGGGTG 
AAATCACATT 
TGCTGGATGA 
GGAATCCCTT 
AGGTCAGAGC 



31 
I 

CGCCGCTCCT 
AAGAGCCGCC 
ACAGGCCTCC 
CACCGCAGCC 
CTCGGCGCTG 
GCCGTCGCCG 
AGCACCGACT 
AGTCCCCACT 
ATCCAGTGAC 
TACCGTGGCT 
CACAAAAAGT 
CACAAGCAGC 
TGTGACCACA 
TCCACTTAGC 
ACATGACCAT 
CTTCACAAGC 
ACAGACCTCC 
GCCCACGAGC 
CCCCACAGCA 
TGAGAGTAAC 
CGTCCTGAAC 
CTCACTGATA 
ACGGCTGGCA 
TAAGCTCCCT 
GGCAGGGGTC 
CTTCAGCATG 
GGCCCTCTAT 
AGAGGAGCTG 
GACCTCTTCT 
CTGGATCGTC 
CCTCTAQTCC 
CAAGTGCOGT 
TCCCCTCCCA 
CTGTCCAGAT 
TGAGGGAGAT 
CATTCTGCAG 
CACTTGGTGA 



41 
I 

CTGCCACTGG 
GCAGCGCGAC 
CCGGGCGGCG 
ACCTGCTCCC 
CTGCTACTGT 
TCGCCCTCCC 
CCAGCATCCA 
TCCAAGGCCA 
TCACCGGGGA 
AGAGGAGGCG 
GCAGACACCA 
CAGAATGGAG 
GACCTCACAT 
CCCOGACAAC 
CTTATGAAAA 
CCGGGGATGA 
AGTCAGATGC 
CCGGCAACGG 
GCATCAACTA 
TGGGCAAAGT 
CTCACAGGAA 
TGCCGAGCAG 
TCTGTTCCAG 
GCCAAGGATG 
AGTGACATGA 
CCCCTCATCA 
GGCTGCTGCC 
CAGACAGTGG 
GAGATGCAGG 
CCTCTGGACA 
GGTCTGCCGG 
TTGGATGGGG 
ATCCCCCCAG 
TCCTCTTGTA 
CAAGAAAAAG 
TGAGATTGCC 
CAGAGCCAGG 



51 
I 

CTCTGCGCCC 
TCGGGAGCCC 
CCCACGCTCC 
GGCCCAGAGG 
TGTCAACGCC 
AGAATGCAAC 
GTGTCACCAT 
ACGAAATCTT 
CTACAACCCT 
GCTCAGGCAA 
CTACAGTTGC 
CAGAAGATAC 
CCACTAAGGC 
CCACTTTGAC 
TTTCAAGCAG 
CCACCACCCT 
CAGCCAGCTC 
CATTGAGAAC 
CCCACCGATA 
GTGAGGATCT 
ACACCCTCTG 
TCAAAGCCAC 
GAAGTCAGAC 
TGTACGAGCG 
AGCTAGGGGA 
TCACCATCGT 
ACCAGCGCCT 
AGAATGGTTA 
AGAAGAAGGT 
ACCTGACCAA 
TGGCCTCCAG 
AAGGGAAAGA 
GGCCTTAATT 
AAATAACCCA 
CCACGTAAGG 
GAGACCTGAA 
ATGAGAACAA 
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AGATTCCATT TGCACCATGC CACACTGCTG TGTTCACATG TGCCTTCOGT CCAGAGCAGT 2280 

CCCQGGCH3Q GGTGAAACTC CAGCAGGTGG CTGGGCTGGA AAGGAGGGCA 6GQCTACATC 2340 

CTCGCTCGGT GGGATCTGAC GACCTGAAAG TCCAGCTCCC AAOTTTTCCT TCTCCTACCC 2400 

CAGCCTOGTG TACCCATCTT CCCACCCTCT ATGTTCTTAC CCCTCCCTAC ACTCAGTGTT 2460 

5 TGTTCCCACT TACTCTGTCC TGGGGCCTCT GGGATTAGCA CAGGTTATTC ATAACCTTGA 2520 

ACCCCTTQTT CTGGATTCGG ATTTTCTCAC ATTTGCTTCG TGAGATGGGG GCTTAACCCA 2580 

CACAGGTCTC CGT6CGTGAA CCAGGTCTOC TTAGGGGACC TGCGTGCAGG TGAGGAGAGA 2640 

AGGGGACACT CGAGTCCAGG CTGGTATCTC AGGGCAGCTG ATGAGGGGTC AGCAGGAACA 2700 

CTGGCCCATT GCCCCTGGCA CTCCTTCCAG AGGCCACCCA OGATCTTCTT TGGGCTTCCA 2760 

10 TTTCCACCAG GGACTAAAAT CTGCTGTAGC TAGTGAGAGC AGCGTGTTCC TTTTGTTGTT 2820 

CACTGCTCAG CTGATGGGAG TGATTCCCTG AGACCCAGTA TGAAAGAGCA GTGGCTGCAG 2880 

GAGAGGCCTT CCCGGGGCCC CCCATCAGCG ATGTGTCTTC AGAGACAATC CATTAAAGCA 2940 

GCCAGGAAGG ACAGGCTTTC CCCTGTATAT CATAGGAAAC TCAGGGACAT TTCAAGTTGC 3000 

TGAGAGTTTT GTTATAGTTG TTTTCTAACC CAGCCCTCCA CTGCCAAAGG CCAAAAGCTC 3060 

15 AGACAGTTGG CAGACGTCCA GTTAGCTCAT CTCACTCACT CTGATTCTCC TGTGCCACAG 3120 

GAAAAGAGGG CCTGQAAAGC GCAGTGCATG CTGGGTGCAT GAAGGGCAGC CTGGGGGACA 3180 

GACTGTTGTG GGAACGTCCC ACTGTCCTGG CCTGGAGCTA GGCCTTGCTG TTCCTCTTCT 3240 

CTGTGAGCCT AGTGGGGCTG CTGCGGTTCr CTTGCAGTTT CTGGTGGCAT CTCAGGGGAA 3300 

CACAAAAGCT ATGTCTATTC CCCAATATAG GACTTTTATG GGCTCGGCAG TTAGCTGCCA 3360 

20 TGTAGAAGGC TCCTAAGCAG TGGGCATGGT GAGGTTTCAT CTGATTGAGA AGGGGGAATC 3420 

CTGTGTGGAA TGTTGAACTT TCGCCATGGT CTCCATCGTT CTGGGCGTAA ATTCCCTGGG 3480 

ATCAAGTAGG AAAATGGGCA GAACTGCTTA GGGGAATGAA ATTGCCATTT TTCGGGTGAA 3S40 

ACGCCACACC TCCAGGGTCT TAAQAGTCAG GCTCOGGCTG TAGTAGCTCT GATGAAATAG 3600 

GCTATCCACT CGGGATGGCT TACTTTTTAA AAGGGTAGGG GGAGGGGCTG GGGAAGATCT 3660 

25 GTCCTGCACC ATCTGCCTAA TTCCTTCCTC ACAGTCTGTA GCCATCTGAT ATCCTAGGGG 3720 

GAAAAGGAAG GCCAGGGGTT CACATAGGGC CCCAGCGAGT TTCCCAGGAG TTAGAGGGAT 3780 

GCGAGGCTAA CAAGTTCCAA AAACATCTGC CCCGATGCTC TAGTGTTTGG AGGTGGGCAG 3840 

GATGGAGAAC AGTGCCTGTT TGGGGGAAAA CAGGAAATCT TGTTAGGCTT GAGTGAGGTG 3900 

TTTGCTTCCT TCTTGCCCAG CGCTGGGTTC TCTCCACCCA GTAGGTTTTC TGTTGTGGTC 3960 

30 CCGTGGGAGA GGCCAGACTG GATTATTCCT CCTTTGCTGA TCCTGGGTCA CACTTCACCA 4020 

GCCAGGGCTT TTGACGGAGA CAGCAAATAG GCCTCTGCAA ATCAATCAAA GGCTGCAACC 4080 

CTATGGCCTC TTGGAGACAG ATGATGACTG GCAAGGACTA GAGAGCAGGA GTGCCTGGCC 4140 

AGGTCGGTCC TGACTCTCCT GACTCTCCAT CGCTCTGTCC AAGGAGAACC CGGAGAGGCT 4200 

CTGGGCTGAT TCAGAGGTTA CTGCTTTATA TTCGTCCAAA CTGTGTTAGT CTAGGCTTAG 4260 

35 GACAGCTTCA GAATCTGACA CCTTGCCTTG CTCTTGCCAC CAGGACACCT ATGTCAACAG 4320 

GCCAAACAGC CATGCATCTA TAAAGGTCAT CATCTTCTGC CACCTTTACT GGGTTCTAAA 4380 

TGCTCTCTGA TAATTCAGAG AGCATTGGGT CTGGGAAGAG GTAAGAGGAA CACTAGAAGC 4440 

TCAGCATGAC TTAAACAGGT TGTAGCAAAG ACAGTTTATC ATCAACTCTT TCAGTGGTAA 4500 

ACTOTGGTTT CCCCAAGCTG CACAGGAGGC CAGAAACCAC AAGTATGATG ACTAGGAAGC 4560 

40 CTACTGTCAT GAGAGTGGGG AGACAGGCAG CAAAGCTTAT GAAGGAGGTA CAGAATATTC 4620 

TTTGCGTTGT AAGACAGAAT ACGGGTTTAA TCTAGTCTAG GCRCCAGATT TTTTTCCCGC 4680 

. TTGATAAGGA AAGCTAGCAG AAAGTTTATT TAAACCACTT CTTGAGCTTT ATCTTTTTTG 4740 

ACAATATACT GGAGAAACTT TGAAGAACAA GTTCAAACTG ATACATATAC ACATATTTTT 4800 

TTGATAATGT AAATACAGTG ACCATGTTAA CCTACCCTGC ACTGCTTTAA GTGAACATAC 4860 

45 TTTGAAAAAG CATTATGTTA GCTGAGTGAT GGCCAAGTTT TTTCTCTGGA CAGGAATGTA 4920 

AATGTCTTAC TGGAAATGAC AAGTTTTTGC TTGATTTTTT TTTTTAAACA AAAAATGAAA 4980 

TATAACAAGA CAAACTTATG ATAAAGTATT TGTCTTGTAG ATCAGGTGTT TTGTTTTGTT 5040 

TTTTTAATTT TAAAATGCAA CCCTGCCCCC TCCCCAGCAA AGTCACAGCT CCATTTCAGT 5100 

AAAGGTTGGA GTCAATATGC TCTGGTTGGC AGGCAACCCT GTAGTCATGG AGAAAGGTAT 5160 

50 TTCAAGATCT AGTCCAATCT TTTTCTAGAG AAAAAGATAA TCTGAAGCTC ACAAAGATGA 5220 

AGTGACTTCC TCAAAATCAC ATGGTTCAGG ACAGAAACAA GATTAAAACC TGGATCCACA 5280 

GACTGTGCGC CTCAGAAGGA ATAATCGGTA AATTAAGAAT TGCTACTCGA AGGTGCCAGA 5340 

ATGACACAAA GGACAGAATT CCTTTCCCAG TTGTTACCCT AGCAAGGCTA GGGAGGGCAT 5400 

GAACACAAAC ATAAGAACTG GTCTTCTCAC ACTTTCTCTG AATCATTTAG GTTTAAGATG 5460 

55 TAAGTGAACA ATTCTTTCTT TCTGCCAAGA AACAAAGTTT TGGATGAGCT TTTATATATG 5520 

GAACTTACTC CAACAGGACT GAGGGACCAA GGAAACATGA TGGGGGAGGC AAGAGAGGGC 5580 

AAAGAGTAAA ACTGTAGCAT AGCTTTTGTC ACGGTCACTA GCTGATCCCT CAGGTCTGCT 5640 

GCAAACACAG CATGGAGGAC ACAGATGACT CTTTGGTGTT GGTCTTTTTG TCTGCAGTGA 5700 

ATGTTCAACA GTTTGCCCAG GAACTGGGGG ATCATATATO TCTTAGTGGA CAGGGGTCTG 5760 

60 AAGTACACTG GAATTTACTG AGAAACTTGT TTGTAAAAAC TATAGTTAAT AATTATTGCA 5820 
TTTTCTTACA AAAATATATT TTGGAAAATT GTATACTGTC AATTAAAGT 



65 



fieq ID NO: 14 Protein sequence; 
Protein Accession #j NPJJ05388 



1 11 21 31 41 51 

I I 1 I i 1 

MRCALALSAL LLLLSTPPLL PSSPSPSPSP 8PSQNATQTT TDSSNKTAPT PASSVTIMAT 60 

DTAQQSTVPT S KANE I LAS V KATTLGVSSD SPGTTTLAQQ VSGPVNTTVA RGGGSGNPTT 120 

70 TIESPKSTKS ADTTTVATST ATAKPNTTSS QNGAEDTTNS OGKSSHSVTT DLTSTKAEHL 180 

TTPHPTSPLS PRQPTLTHPV ATPTSSGHDH LMKISSSSST VAIPGYTETS PGWTTTIiPSS 240 

VISQRTQQTS SQMPASSTAP SSQBTVQPTS PATALRTPTL PBTMSSSPTA ASTTHRYPKT 300 

PSPTVAHESN WAKCEDLETQ TQSEKQLVLN LTGNTLCAGO ASDEKLISLI CRAVKATPNP 360 

AQDKCGIRLA SVPGSQTVW KEITIHTKLP AKDVYERLKD KWDELKEAGV SDMKLGDQGP 420 

75 PEEAEDRFSM PLIITIVCMA SPLLLVAALY GCCHQRLSQR KDQQRLTBEL QTVENGYHDN 480 
PTLEVMETSS EMQEKKWSL NGELGDSWTV PLDNLTKDDL DEEBDTHIi 
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Seq 10 NO: IS Nucleotide sequence: 
Nucleic Acid Accession Ui NH_004105 

Coding sequence: 150.. 1631 (underlined sequences correspond to start and stop codons) 



CTAGTATTCT 
AAAATAAAAA 
CAAGGTAACT 
CTCTGGCQCT 
ACGGATATGA 
TCCCAGACGC 
TTCCGAAAAC 
CAGAAGGAAC 
GAQTGTTGCC 
AGACTGGCCG 
CCAACCCTTC 
GCCAAGACAT 
TCAATTTACG 
AGTGCGTAGA 
CACCAGGCTC 
CCTGOGTAGA 
TTCTTGGTTC 
ACTGTGAAGA 
ATGAACCTGG 
CATGTCAAGA 
GGAATTATCA 
TAACACCAGA 
AGTCAATAGT 
TCCAGATACA 
GAAATGAAAA 
TCGTGAAGTC 
GCAGTATAGG 
TTTCATT TTft 
TTACCTTAAA 
ACACTCACCC 
TTCAAAGTTT 
TTCAAGAGAG 
ACCAAGCAAT 
CTCATAAGGA 
AACTCCTTTG 
■TCTGCCATAT 
ATAGTAATAA 
AGAAATTTTA 
AAAATGAACC 
AGAAAATAAA 
ATAATAACAT 
TTCACTGGAG 
AACTAAAATT 



GGAATATTTT 
TTTAAATAAA 



11 
I 

ACTAGAACTG 
GCTTGAGCAG 
CTGCTAGCTA 
GGTCAAGTCA 
GTGGGATCCT 
TTGTAAAGGT 
AGCCCAGATT 
CTCAGGGGCA 
CGGGGGTGGT 
AAATAACTTT 
CCACCGTATC 
AGACGAGTGC 
GGGATOCTTT 
CATAGATGAA 
ATTTTATTGC 
TATAAATGAA 
ATTCATCTGT 
CATTGATGAA 
GAAATTCTCA 
TATAAATGAG 
TGGOGGCTTC 
GAACCGATGT 
CTACAAATAC 
GGCCACAACT 
TGGAGAGTTC 
ATTATCAGGA 
GACCTTCCGC 
GTCTTTTCTA 
GCACTATTTT 
ATAACAAACA 
GTCTTTATTA 
CTAAGTATAC 
GATGATCTTC 
GGCAGCCATC 
TAAGAAAATG 
TTGTGTTGGT 
ATCATTGCTG 
GATTGTGAAT 
AGATACCCCC 
TTCCTTCTAA 
CCCTGAATTC 
AAGTTTTAAT 
TCTCATCTTC 
ATCCAGAGTG 
ATACGTCTCC 
AATAAATATT 



21 

I 

GAAGATTGCT 
CAATTCATAT 
AGATTCACAA 
CAGGACACCG 
GTGAGACAGC 
GGAATGAAGT 
ATTGTCAATA 
ACCACCGGGG 
TTTGTGGCCA 
GTCATCCGGC 
CAGTGTGCAG 
ACTGGAGGGA 
GCATGTCAGT 
TGTACCATCC 
CAGTGCAGTC 
TGTGATGCCA 
CAGTGCAATC 
TGCAGAACCT 
TGTATGTGCC 
TGTGAGACCA 
CGTTGTTATC 
GTTTGCCCAG 
ATGAGCATCC 
ATTTATGCCA 
TACCTACGAC 
CCAAGAGAAC 
ACAAGCTCTG 
AQAGTCAACC 
ATTTATAGAT 
ATTACACCAT 
CTATATGTAA 
ACTATCTGGT 
TGTGGTGCTT 
ATAACCATTG 
GAAAAGGTCA 
TTTTATTTTC 
TACAACATGC 
ATTTTGTAAA 
TAGAAAATTA 
ACCACATTGG 
AGGCATTCAC 
TTCTAAGTAA 
GTACTTGATG 
ACAGTGAACT 
TTGTTTAAAA 
CCTTTAGAAG 



31 

1 

CTCCGAGTTT 
TACTGTCACA 
TGTTGAAAQC 
AAGAAACCAT 
AATGCAAAGA 
GTGTCAACCA 
ATGAACAGCC 
TTGTAGCTGC 
GTGCTGCTGC 
GGAACCCAGC 
CAGGCTAOGA 
CGCACAACTG 
GCCCTCCTGG 
CTCCATATTG 
CTGGGTTTCA 
GCAATCAATG 
AAGGATATGA 
CAAGCTACCT 
CCCAGGGATA 
CAAATGAATG 
CACGAAATCC 
TCTCAAATGC 
GATCTGATAG 
ACACCATCAA 
AAACAAGTCC 
ATATCGTGGA 
TGTTAAGATT 
ACAGGCATTT 
ATATCTAGTG 
GGTATAAAGT 
ATTAGACATT 
GAAACTTGGA 
AAGGAAACTT 
AATAGCATGC 
ATAAAGATAT 
ATATCCAGCC 
TGGTTTCTGT 
AAACAGTAAG 
TACTATTGAG 
AACTGACCTG 
AAGATGCAGA 
AATTTAAATC 
CTCACAGAGG 
TAAGCAAATT 
TCTGACTGCT 
•ATCACTCTAA 



41 

I 

TTTTTTTGTT 
GGTATTTTTG 
CCTTTTCCTA 
CACGTACACG 
TATTGATGAA 
CTATGGAGGA 
TCAGCAGGAA 
CAGCAGCATG 
AGTCGCAGGC 
TQACCCTCAG 
GCAAAGTGAA 
TAGAGCAGAC 
ATATCAGAAG 
CCACCAAAGA 
ATTGGCAGCA 
TGCTCAGCAG 
GCTAAGCAGT 
GTGTCAATAT 
CCAAGTGGTG 
CCGGGAGGAT 
TTGTCAAGAT 
CATGTGCOGA 
GTCTGTGCCA 
TACTTTTCGG 
TGTAAGTGCA 
CCTGGAGATG 
GACAATAATA 
AAGTCAGCCA 
CATCTACATC 
GGGCATTTAA 
AATCCACTAA 
TTCTTTCCTA 
ACTAGAGCTC 
AAGGGTAAGA 
ATTTCTTTAG 
TAAAGGTGGT 
AGGGTATTTT 
CAAAATTTTC 
AAATCTATGG 
AAGAAGCAAA 
ACAAAATGGA 
CTAACACTTC 
AAGAAAATGA 
ACCCTCCTAC 
TTACTTTGAT 
AA 



51 
I 

ATTTTGTTAA 
CTGTGCTGTG 
ACTATGCTGA 
CAATGCACTG 
TGTGACATTG 
TACCTCTGCC 
ACACAACCAG 
GCAACCAGTG 
CCTGAAATGC 
CGCATTCCCT 
CACAACGTGT 
CAAGTGTGCA 
CGAGGGGAGC 
TGOGTGAATA 
AACAACTATA 
TGCTACAACA 
GACAGGCTCA 
CAATGTGTCA 
AGAAGTAGAA 
GAAATGTGTT 
CCCTACATTC 
GAACTGCCCC 
TCAGACATCT 
ATTAAATCTG 
ATGCTTGTGC 
CTGACAGTCA 
GTGGGGCCAT 
AAGAATATTG 
TCTATACTGT 
TATGTAAAGA 
ACTGGTCTTC 
TAAAAGTGGG 
CACTAACAGT 
ATGAGTTTTT 
AAAATGGGGA 
TGTTTATTAT 
TAATTTTGTC 
CAGAATTCCC 
GGAGGATATG 
CTCGGAAAAT 
TAAAAGGTAT 
ACTAATTTAT 
TGATGGTTTT 
CCAATTCTAT 
GTATCATATT 
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Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP_004096 



1 11 21 31 41 51 

11(111 

MLKALFLTML TLALVKSQDT EETITYTQCT DGYEWDPVRQ QCKDIDECDI VPDACKGGMK 60 
CVNHYGGYLC LPKTAQIIVN NBQPQQETQP AEGTSGATTG WAASSMATS GVLPGGGFVA 120 
SAAAVAGPEM QTGRNNPVIR RNPADPQRIP SNPSHRIQCA AGYEQSEHNV CQDIDECTAG 180 
THNCRADQVC INLRGSFACQ CPPGYQKRGE QCVDIDECTI PPYCHQRCVN TPGSFYCQCS 240 
PGFQLAANNY TCVDINECDA SNQCAQQCYN ILGSPICQCN QGYBLSSDRL NCEDIDECRT 300 
SSYLCQYQCV NEPGKPSCMC PQGYQWRSR TCQDINECET TNECREDEMC WNYHGGFRCY 360 
PRNPCQDPYI LTPENRCVCP VSNAMCRBLP QSIVYKYMSI RSDRSVPSDI FQIQATTIYA 420 
NTINTPRIKS GNENGEFYLR QTSPVSAMLV LVKSLSGPRE HTVDLEMLTV SSIGTFRTSS 480 
VLRLTIIVGP PSF 

Seq ID NO: 17 Nycleotj.de, sequence: 
Nucleic Acid Accession #: NM_018894 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I I t 

AAAACATTCA ACAAATTAAT GGGTGTAAGG AACTGGAAAA CCTGGACTCC TACCACATGC 60 

5 AGATAAAACC AATAGAGTGC AGAATAAGAC TCAAGTCAAO TAAGTAACGT TAAACACCAT 120 

AAAGACACAT GGCCTTCTTT GTGTACATGA CATGCATTCT CAACAATGCA CTOACGGATA 180 

TGACTGGOAT CCTGTGAGAC AGCAATGCAA AOATATTGAT GAATGTGACA TTGTCCCAGA 240 

OOCTTGTAAA GOTGGAATQA AGTGTGTCAA CCACTATOGA GGATACCTCT GCCTTCCGAA 300 

AACAGCCCAG ATTATTGTCA ATAATGAACA GCCTCAGCAG GAAACACAAC CAOCAGAAGG 360 

10 AACCTCAGGG GCAACCACCG GGGTTGTAGC TGCCAGCAGC ATGGCAACCA GTGGAGTOTT 420 

GCCCGGGGGT GGTTTTGTGG CCAGTGCTGC TGCAGTCGCA GGCCCTGAAA TGCAGACTGG 480 

CCQAAATAAC TTTGTCATCC GGCGGAACCC AGCTGACCCT CAGCGCATTC CCTCCAACCC S40 

TTCCCACCGT ATCCAGTGTG CAGCAGGCTA CGAGCAAAGT GAACACAACG TGTGCCAAGA 600 

CATAGACGAG TGCACTGCAG GGACGCACAA CTGTAGAGCA GACCAAGTGT GCATCAATTT 660 

15 ACGGGGATCC TTTGCATGTC AGTGCCCTCC TGGATATCAG AAGCGAGGGG AGCAGTGCGT 720 

AGACATAGAT GAATGTACCA TCCCTCCATA TTGCCACCAA AGATGCGTGA ATACACCAGG 780 

CTCATTTTAT TGCCAGTGCA GTCCTGGGTT TCAATTGGCA GCAAACAACT ATACCTGCGT 840 

AGATATAAAT GAATGTGATG CCAGCAATCA ATGTGCTCAG CAGTGCTACA ACATTCTTGG 900 

TTCATTCATC TGTCAGTGCA ATCAAGGATA TGAGCTAAGC AGTGACAGGC TCAACTGTGA 960 

20 AGACATTGAT GAATGCAGAA CCTCAAGCTA CCTGTGTCAA TATCAATGTG TCAATGAACC 1020 

TGGGAAATTC TCATGTATGT GCCCCCAGGG ATACCAAGTG GTGAGAAGTA GAACATGTCA 1060 

AGATATAAAT GAGTGTGAGA CCACAAATGA ATGCCGGGAG GATGAAATGT GTTGGAATTA 1140 

TCATGGCGGC TTCCGTTGTT ATCCACGAAA TCCTTGTCAA GATCCCTACA TTCTAACACC 1200 

AGAGAACCGA TGTGTTTGCC CAGTCTCAAA TGCCATGTGC CGAGAACTGC CCCAGTCAAT 1260 

25 AGTCTACAAA TACATGAGCA TCCGATCTGA TAGGTCTGTG CCATCAGACA TCTTCCAGAT 1320 

ACAGGCCACA ACTATTTATG CCAACACCAT CAATACTTTT CGGATTAAAT CTGGAAATGA 1380 

AAATGGAGAG TTCTACCTAC GACAAACAAG TCCTGTAAGT GCAATGCTTG TGCTCGTGAA 1440 

GTCATTATCA GGACCAAGAG AACATATCGT GGACCTGGAG ATGCTGACAG TCAGCAGTAT IS 00 

AGGGACCTTC CGCACAAGCT CTGTGTTAAG ATTGACAATA ATAGTGGGGC CATTTTCATT 1560 

30 TTAGTCTTTT CTAAGAGTCA ACCACAGGCA TTTAAGTCAG CCAAAGAATA TTGTTACCTT 1620 

AAAGCACTAT TTTATTTATA GATATATCTA GTGCATCTAC ATCTCTATAC TGTACACTCA 1680 

CCCATAACAA ACAATTACAC CATGGTATAA AGTGGGCATT TAATATGTAA AGATTCAAAG 1740 

TTTGTCTTTA TTACTATATG TAAATTAGAC ATTAATCCAC TAAACTGGTC TTCTTCAAGA 1800 

GAGCTAAGTA TACACTATCT GGTGAAACTT GGATTCTTTC CTATAAAAGT GGGACCAAGC 1860 

35 AATGATGATC TTCTGTGGTG CTTAAGGAAA CTTACTAGAG CTCCACTAAC AGTCTCATAA 1920 

GGAGGCAGCC ATCATAACCA TTGAATAGCA TGCAAGGGTA AGAATGAGTT TTTAACTGCT 1980 

TTGTAAGAAA ATGGAAAAGG TCAATAAAGA TATATTTCTT TAGAAAATGG GGATCTGCCA 2040 

TATTTGTGTT GGTTTTTATT TTCATATCCA GCCTAAAGGT GGTTGTTTAT TATATAGTAA 2100 

TAAATCATTG CTGTACAACA TGCTGGTTTC TGTAGGGTAT TTTTAATTTT GTCAGAAATT 2160 

40 TTAGATTGTG AATATTTTGT AAAAAACAGT AAGCAAAATT TTCCAGAATT CCCAAAATGA 2220 

ACCAGATACC CCCTAGAAAA TTATACTATT GAGAAATCTA TGGGGAGQAT ATGAGAAAAT 2280 

AAATTCCTTC TAAACCACAT TGGAACTGAC CTGAAGAAGC AAACTOGGAA AATATAATAA 2340 

CATCCCTGAA TTCAGGCATT CACAAGATGC AGAACAAAAT GGATAAAAGG TATTTCACTG 2400 

GAGAAGTTTT AATTTCTAAG TAAAATTTAA ATCCTAACAC TTCACTAATT TATAACTAAA 2460 

45 ATTTCTCATC TTCGTACTTG ATGCTGACAG AGGAAGAAAA TGATGATGGT TTTTATTCCT 2520 

GGCATCCAGA GTGACAGTGA ACTTAAGCAA ATTACCCTCC TACCCAATTC TATGGAATAT 2S60 

TTTATACGTC TCCTTGTTTA AAATCTGACT GCTTTACTTT GATGTATCAT ATTTTTAAAT 2640 
AAAAATAAAT ATTCCTTTAG AAGATCACTC TAAAA 

50 Seq ID NO; 18 Protein sequence; 

Protein Accession #: NP_061489.1 

1 11 21 31 

„ I I I I 

55 MHSQQCTDGY EWDPVRQQCK DIDSCDIVPD ACKGGMKCVH 

PQQBTQPAEG TSGATTGWA ASSMATSGVL PGGGFVASAA 

ADPQRIPSNP SHRIQCAAGY EQSEHNVCQD IDECTAGTHN 

GYQKRGEQCV DIDECTIPPY CHQRCVNTPG SFYCOCSPGP 

CAQQCYNILG SFICQCNQGY BLSSDRLNCE DIDECRTSSY 
60 YQWRSRTCQ DINECBTTNE CREDEMCWNY HGGFRCYPRN 

AMCRELPQSI VYKYMSIRSD RSVPSDIFQI QATTIYANTI 

PVSAMLVLVK SLSGPREHIV DLEMLTVSSI GTPRT8SVLR 

Seq ID NO: 19 Nucleotide pegyencflj 
65 Nucleic Acid Accession #t NM_006500 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 



60 
120 
180 
240 
300 
360 



41 51 



HYGGYLCLPK 


TAQIIVNNEQ 


60 


AVAGPEMQTG 


RNNFVIRRNP 


120 


CRADQVCINL 


RGSFACQCPP 


180 


QLAANNYTCV 


DINE CD AS NQ 


240 


LCQYQCVNEP 


GKPSCMCPQG 


300 


PCQDPYILTP 


ENRCVCPVSN 


360 


NTFRIKSGNE 


NGEPYLRQTS 


420 


LTIIVGPPfiP 







1 11 21 31 41 51 

70 | | | | | | 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 
CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 

75 TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 
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GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

TCTACAAAGC TCCGGAGGAG CCAAACATCC AGOTCAACCC CCTGGGCATC CCTGTGAACA 480 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA CGGGTACCCC ATTCCTCAAG S40 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

5 CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTOCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGTGGCTQGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGQ GGACCGCGTG GAAATCAGGT 640 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAQCAGAAC CCCAGCACCA 900 

10 GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTG AGTCCOGCAG 1080 

CCOCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTOTGC 1200 

15 TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA TCGCTGOGTG GCGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

20 TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGOGTGGTCA TOGTCGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

25 GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

rTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

30 GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

35 GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

40 TACAAAAAAA AATTAGCTAG GOGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC ' AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

45 GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

50 AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 



Seq ID NO: 20 Protein sequence: 
Protein Accession #s NP_006491 

60 

1 11 21 31 41 51 

I I I I I I 

NGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 
DWFSVHKEKR TLIPRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD BRIPLCQGKR 120 

65 PRSQEYRIQL RVYKAPEBPN IQVNPLGIPV NSKBPEBVAT CVGRNGYPIP QVIWYKNGRP 180 
LKEEKNRVHI QSSQTVESSG LYTLQ3ILKA QLVKEDKDAQ FYCELNYRLP SGNHMKKSRE 240 
VTVPVFYPTB KVWLEVBPVG MLKEGDRVEI RCIADGNPPP HFSISKQtfPS TREAEEETTN 300 
DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LL3EPQELLV NYVSDVRVSP AAPERQEGSS 360 
LTLTCEAESS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

70 QLVKLAIFGP PWMAPKERKV WVKENMVLNL SCEASGHPRP TI5WNVNGTA SEQDQDPQRV 480 
LSTLNVLVTP ELilaETGVECT ASNDLGKNTS ILPLELVNLT TLTPDSNTTT GIiSTSTASPH 540 
TRANSTSTBR KLPEPBSRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSOKLPEE KGLLCGSSGO KRAPGDQGEK YIOLRH 

75 
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Seq ID HO: 21 Nucleotide sequence: 
Nucleic Acid Accession fi: NM_00242l 

Coding sequence: 72.. 1401 {underlined sequences correspond to start and stop codons) 



1 11 21 

I I t 

GGGATATTGG AGTAGCAAGA GGCTGGGAAG 
CAAAGGCCAG TATGCACAGC TTTCCTCCAC 
CTCACAGCTT CCCAGCGACT CTAGAAACAC 
ACCTGGAAAA ATACTACAAC CTGAAGAATG 
GTGGCCCAGT GGTTGAAAAA TTGAAGCAAA 
GGAAACCAGA TGCTGAAACC CTGAAGGTGA 
TGGCTCAGTT TGTCCTCACT GAGGGGAACC 
GGATTGAAAA TTACACGCCA GATTTGCCAA 
CCTTCCAACT CTGGAGTAAT GTCACACCTC 
CAGACATCAT GATATCTTTT GTCAGGGGAG 
CTGGAGGAAA TCTTGCTCAT GCTTTTCAAC 
TTG ATGAAGA TGAAAGGTGG ACCAACAATT 
CTCATGAACT CGGCCATTCT CTTGGACTCT 
ACCCTAGCTA CACCTTCAGT GGTGATGTTC 
AAGCCATATA TGGACGTTCC CAAAATCCTG 
CGTGTGAGAG TAAGCTAACC TTTGATGCTA 
TTAAAGACAG ATTCTACATG CGCACAAATC 
TTTCTGTTTT CTGGCCACAA CTGCCAAATG 
GAGATGAAGT CCGGTTTTTC AAAGGGAATA 
TACACGGATA CCCCAAGGAC ATCTACAGCT 
TCGATGCTGC TCTTTCTGAG GAAAACACTO 
ACTGGAGGTA TGATGAATAT AAACGATCTA 
ATGACTTTCC TGGAATTGGC CACAAAOTTG 
ATTTCTTTCA TGGAACAAGA CAATACAAAT 
TCCAGAAAGC TAATAGCTGG TTCAACTGCA 
AAAACACATG GTGTGAGTCC AAAGAAGGTG 
CATTTTTAAC CTCTAGAGTC ACTGATACAC 
CATATTTTTT TACTATITAG AATGTAGCCC 
ATGGTGGGTA CAAAAAGTCA AGTTTGTGGC 
GATCTTTTCC AGAGTATGCA ACTCTGACGT 
ATATCCTTTC AAGACAGAAA GAGACAGGAG 
CAAGAACACA TGTGCAGTCA CTGGTGTCAC 
ACAAAATAAG TGTTTTATGT TTGGAATAAA 



31 41 51 

I i I 

CCATCACTTA CCTTGCACTG AGAAAGAAGA 60 

TGCTGCTGCT GCTGTTCTGG GGTGTGGTGT 120 

AAGAGCAAGA TGTGGACTTA GTCCAGAAAT 180 

ATGGGAGGCA AGTTGAAAAG OGGAGAAATA 240 

TGCAGGAATT CTTTGGGCTG AAAGTGACTG 300 

TGAAGCAGCC CAGATGTGGA GTGCCTGATG 360 

CTOGCTGGGA GCAAACACAT CTGACCTACA 420 

GAGCAGATGT GGACCATGCC ATTOAGAAAO 480 

TGACATTCAC CAAGGTCTCT GAGGGTCAAG 540 

ATCATCGGGA CAACTCTCCT TTTGATGGAC 600 

CAGGCCCAGG TATTGGAGGG GATGCTCATT 660 

TCAGAGAGTA CAACTTACAT CGTGTTGCGG 720 

CCCATTCTAC TGATATCGGG GCTTTGATGT 780 

AGCTAGCTCA GGATGACATT GATGGCATCC 840 

TCCAGCCCAT CGGCCCACAA ACCCCAAAAG 900 

TAACTACGAT TCGGGGAGAA GTGATGTTCT 960 

CCTTCTACCC GGAAGTTGAG CTCAATTTCA 1020 

GGCTTGAAGC TGCTTACGAA TTTGCCGACA 1080 

AGTACTGGGC TGTTCAGGGA CAGAATGTGC 1140 

CCTTTGGCTT CCCTAGAACT GTGAAGCATA 1200 

GAAAAACCTA Cn VT TT G TT GCTAACAAAT 1260 

TGGATCCAGG TTATCCCAAA ATGATAGCAC 1320 

ATGCAGTTTT CATGAAAGAT GGATTTTTCT 1380 

TTGATCCTAA AACGAAGAGA ATTTTGACTC 1440 

GGAAAAATXS AACATTACTA ATTTGAATGG 1500 

TTTTCCTGAA GAACTGTCTA TTTTCTCAGT 1560 

AGAATATAAT CTTATTTATA CCTCAGTTTG 1620 

TTTTTGTACT GATATAATTT AGTTCCACAA 1680 

TTATGGATTC ATATAGGCCA GAGTTGCAAA 1740 

TGATCCCAGA GAGCAGCTTC AGTGACAAAC 1800 

ACATGAGTCT TTCCCGGAGG AAAAGCAGCT 1860 

CCTGGATAGG CAAGGGATAA CTCTTCTAAC 1920 
GTCAACCTTG TTTCTACTGT TTT 



Seq ID NO: 22 Protein, sequence: 
protein Accession #: NP_002412 



1 11 21 31 41 51 

I I I I I I 

MHSFPPLLLL LFWGWSHSP PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEF PGLKVTGKPD AETLKVMKQP RCGVPDVAQP VLTEGNPRWE QTHLTYRIBN 120 

YTPDLPRADV DHAISKAFQL WSNVTPLTFT' KVSEGQADIM ISFVRGDHRD NSPFDGPGGN 180 

IAHAFQPGPG IGGDAHFDED ERWTNNPREY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 240 

TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVP WPQLPNGLEA AYEFADRDBV RFFKGNKYWA VQGQNVLHGY 360 

PKDIYSSFGF PRTVKHIDAA LSSENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFPYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 



Seq ID NO: 23 Nucleotide sequence; 

Nucleic Acid Accession <h FGENBSH predicted ORP 

Coding sequence: 141-1580 {underlined sequences correspond to start and stop codons) 



1 11 21 



TCTGCGTGTG CCGGGGCTAG 




AGGCAAACAG AGGAGGGAAG GCGTCTTAGG 
CTCTACAGGC CTGTGTCGCT ATGGGTTCCC 
TCCGCGAGTT CACTCGCCAC TCCTCCGACG 
GCGGGATCCT CACTGACGTC ACGCTGCTGG 
CAGTTCTCAT CGCCTGCAGT GGCTTCTTCT 
GGGTGGACGT GCTCTCTCTG CCCGGGGGTC 
ACTTCATGTA CACTTCGCGC CTGCGCCTCT 
CCGCCACCTA TTTGCAGATG GAGCACGTGG 
GCTATGAACC TCTGGGCATC TCCCTGCGCC 
CGGCCCCTCC ACCAGGTAGT CCCAGGCGCT 



31 41 51 

III 

TCCTGGCTCT AGTTGCACCT CGGAAGGAAA 60 

ACTGCCTGGA TCCAGAGCAC TTTCCTCGGC 120 

CCGCCGCCCC GGAGGGAGCG CTGGGCTACG 180 

TGCTGGGCAA CCTCAACGAG CTGCGCCTGC 240 

TTGGCGGGCA ACCCCTCAGA GCACACAAGG 300 

ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 360 

CCGAAGCGAG AGGCTTCGCC CCTCTATTGG 420 

CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 480 

TCCAGGCATG CCACCGCTTC ATCCAGGCCA 540 

CCCTGGAAGC AGAACCCCCA ACACCCCCAA 600 

CCGAAGGACA CCCAGACCCA CCTACTGAAT 660 
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CTCGAAGCTG 
QOAAAAAOTA 
GGGAGAGAAG 
CCAGCAGCAG 
GCAGQCTCTC 
CCTACCTCCT 
CACTACCGGG 



10 



15 



20 



25 



30 



35 



40 



45 



50 



GQTCTTOGTT 
AGCCTTACCA 
ACAGCCGCAT 
TACAGGTGGC 
GCCCTACCTG 
ACACCGGAGA 
AACTGCGGCT 
ACATTCTCGO 
GAAAGCTGCA 
CACTTTGGTA 
GATC CTGGCT 
GTTTCTGAGG 
CCTAAGGGAA 
TTTATTGAGG 
TTCCACAAGT 
CAGAGATTAC 
TAACTTTTAT 
TGGAAAAAGG 
TCTAGGGCAG 
TTCTGGATGT 
CAAGGGTGAT 
GAGGCTGTGG 
GGGTGTGGGG 
TCTGCAAGAT 
GCTGTATCCA 
CACGGGGGCC 
TTATCAGAGA 
GGGAAGAATC 
ACTACAGAGG 
TCTTCCTTTA 
TCCTCTAGAA 
TCTTCCCATC 
AAGAAAAGGG 
ATTACAGATG 
TTCAAGTAGG 
GAAAGTGAGG 
GCTGAAGCCT 
OGAGGGTAGA 
GTTCTAGTGG 
TCTCCCCATG 
AGAAGGAATC 
AGAAATGTTA 
AAAAAAAAAA 



CAGTCAAGGC 
CAAGTACATC 
TTCTGGTCAA 
CAGCAGCAGC 
TCCAACTGCT 
CACATCCCAG 
AAGTGAATTT 
CTCCTTGGTT 
CCGCTACAAG 
CTGCTCAATC 
CCATTCGGGA 
ACATCTGOGG 
OGGAACCCGC 
GAAGCCTTAC 
GCATCTGOGC 
GGGGCCCTAQ 
GGCCCAGGCC 
TCAGAAATTG 
AGATCTGCCT 
AGAGAGCTAG 
TAGCCCTCCA 
CCTTTGGGTG 
GTGATTAAAA 
TAGCCCTTGG 
CTTTAGAATT 
GGTTCTCTGT 
CTCTGGGAAC 
TGTAGGTTCT 
AGGAACCATT 
GTGTGGGGGA 
GATTCTGTAT 
GGTCCAGAAT 
TCTATAGTGG 
TGTTCTTAGC 
TGATGTGACC 
ATGAAACTCT 
CATATGGGTT 

GGGATGGTGC 
CCTGCATTCC 
GCTGAGTTCC 
TAAAAGATTG 
ATTAAGAGGT 
AACAGGGTTG 
TGATTGATAG 
AAGTAAGAAG 
CTGTOGCCTG 
GCCCCACTGC 
ATGATTTCTA 
GATCTTGCAA 



CCCCCCAGTC 
GTGCTAAACT 
CCTTGCCCCC 
AGCAGCAGCA 
GCCACTGTGC 
GCTCAAGACA 
TTCAGCTGCC 
CCTGGGGACG 
GGCAACCTTG 
TQCGGAGCCC 
GAGAAGCCGT 
GCGCACGTGC 
TTCCGCCACC 
CACTGCGACC 
CAGAAACACG 
CTGAGCGCAG 
TTGCTTCCCT 
CCACCCTCTT 
CTGTTTTGCT 
CTAGGGGCTG 
CCTGTGGCCC 
GCACCGGGGC 
GTGACCAGAA 
CTCTCTOGTT 
GTTCTTTCTC 
GTTCTGCCCC 
ATGOGGGATT 
CTAGCAGTCT 
ATGTTGAGCC 
TTCTGTATCT 
CTGGATTCCG 
CTAAAATSTC 
TAGAGACCCA 
ACTGAGTTGA 
TTTTCTGACT 
TTAGCTTGAT 
TGAATGTTAC 
GACATCTTCT 
TTGGTAACCT 
TGTCTGGAAC 
ATTCTGGGTT 
ACTAGCCCAT 
TGGTTGAGGG 
CCTCTTGGCT 
TTCTGCCCCT 
CACTTTTGAA 
GGGACTAGTG 
AGAATTAAAG 
TTTAGCAGAT 
CATCAGATCC 



cagccagccc 
ctcaggcctc 
aagccaggct 
gtgaagaagg 

AGTTCAAATG 
CCTCTGGATC 
AGAACTGTGA 
AAGACAAACC 
CCAGTCATCG 
GTTTTAACCG 
ATAAGTGTGA 
TGATCCACAC 
TGCAGACCCT 
CCTGTGGCCT 
GAGCTGCTAC 
GCCCAGGCCC 
ATCAGGCTTG 
AATTTCTCAC 
GGTCAAAACC 
GGAAAGGGGA 
CCATTGCATT 
CTTCATTCGA 
ACACAGAAGG 
TGGCTTGGGT 
CTGTTTGTTT 
TGTAATTCTA 
GTGGAATTGG 
AGAAATGGAT 
CAAAATGGAA 
GGATTCCGTA 
TATCACTCCA 
CCATTAATCT 
CCAGGGCTCA 
TCGCTCCATG 
CTGCCCAGTC 
TAGATGGTAA 
CTGGGGTTCT 
GGCAAGTGTC 
TACCTTTTAA 
CAGTGAATGC 
TGCTGTAGTT 
AGGCCAAAGG 
GTGCAGTTTC 
GGGTGGAGTC 
TGTTGCCCTG 
TTTGTGGGGT 
AGAAAGCTAC 
AAGGAAGAAG 
TGGATGGGCA 
TTGGAATAAA 



TGACCCCAAG 
CCAAGCAGGG 
CCCCAGTGGA 
ACCCATTCCT 
TGGGGCTCCA 
ACCCTCTGAA 
GGCTGTGGCA 
CTATAAGTGT 
TACAGTGCAC 
GGCAGCAAAC 
GACGTGCGGC 
OGGGGAGAAG 
CAAGAGCCAC 
GCATTTCCGG 
CAACACCAAA 
CACTTGCTTC 
GGCATAGGGG 
TGGGGAGAGC 
TCTTCCCCAC 
GAGATTGGAG 
CAGTTTATCT 
TTGCATTTCC 
TGAGATCACA 
ATTTTATATT 
GCTTGTTAGT 
GGTCTGGAAC 
GTCAGGAACC 
ACAGACATTT 
GTAATAATAA 
TCACTCCAAC 
AGTGGAGGCT 
GGTCACTTGG 
AGTGGAGTCC 
GGGGAGAGAT 
TCTATGAATG 
ACAGTGTTAA 
CTCTATTGAG 
CAGATGCCAG 
AAGCTGGGTC 
ATTAGAACCT 
TGGTTGGGAT 
CCTGTTCTAG 
TGGTGTAGGC 
TCTGAAATGT 
GGGCTTATCT 
AGAACTTCAA 
TCTTCTCCCT 
GGAAGGCGGA 
GGTGGAGAAT 
GAAGCCTCTC 



GCCTGCAACT 
AGCCTGGTCG 
GACGAGGCCT 
GGTCCCCAGA 
GCCAGTACCC 
OGGGCTCGTC 
GGGTGCTCAT 
CAGCTGTGCC 
ACAGGGGAAA 
CTGAAAACGC 
TCGCGCTTTG 
CCCTACCCTT 
GTTCGCATCC 
CACAAGAGTC 
GTGCACTACC 
CTGCGGGTGG 
TGTGCCAGGC 
AGGGGTGGCA 
AAGCCAGATT 
TCCTGGTCTC 
GTAAATATAA 
CACTCCCCTC 
GCTCTGCTGG 
ATTTCTGTCA 
TTGTTTAAAA 
CTTTATTTGT 
CTCTCTGQTA 
CTCTGTTCTT 
ATGCCTCCTG 
TGGAGGCTGT 
GGCAGGTTTT 
GTTTGGCTCT 
ATCATCCTCC 
CAGACATTCC 
TTATGGCCTA 
CCCATCCTTT 
TTGAGCCCCT 
AACCTTCTTT 
TGTGACCTGG 
TCCATAGGAA 
TATTGTTGGC 
TTGACCAAGT 
CAGGTAGGTA 
TAGAAGAAGC 
GATTATGGGA 
CAATAAGTCA 
CTTCCCTCTT 
GGAGTCTATA 
GCCTGGGGGT 
TGYGCWRAAA 



720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 



55 



Seq ID NO: 24 Protein sequence: 
Protein Accession #: FGENESH predicted 



60 



65 



i 
I 

MGSPAAPBGA 
GFFYSIFRGR 
EHWQACHRF 
PPSPASPDPK 
S5SSEEGPIP 
PSCQNCEAVA 
CGARFNRPAN 
FRHLQTLKSH 



11 
I 

LGYVREFTRH 
AGVGVDVLSL 
IQA3YE PI/31 
ACNWKKYKYI 
GPQSRLSPTA 
GCSSGLDSLV 
LKTHSRIHSG 
VRIHTGEKPY 



21 
I 

SSDVLGNLNE 
PGGPEARGFA 
SLRPLBAEPP 
VLNSQASQAG 
ATVQFKCGAP 
PGDEDKPYKC 
EKPYKCETCG 
HCDPCGLHFR 



31 
I 

LRLRGILTDV 
PLLDFKYTSR 
TPPTAPPPGS 
SLVGBRSSGQ 
ASTPYLLTSQ 
QLCRSSFRYK 
SRFVQVARTiR 
HKSQLRLHLR 



41 51 

I I 
TLLVGGQPLR AHKAVLIACS 
XiRLSPATAPA VLAAATYLQM 



PCPQARLPSG DEASSSSSSS 
AQDTSGSPSE RARPLPGSEF 
GNLASHRTVH TGEKPYHCSI 
AHVLIHTGEK PYPCPTCGTR 
QKHGAATNTK. VHYHILGGP 



60 
120 
1B0 
240 
300 
360 
420 



70 



75 



Seq ID NO: 25 Nucleotide peqqepcei 
Nucleic Acid Accession «: U21551 

Coding sequence : 1 . , 1155 (underlined sequences correspond to start and stop 



41 



51 



1 U 21 31 

I I I I 1 1 

ATGGATTQCA GTAACGGATC GGCAGAGTGT ACCGGAGAAG GAGGATCAAA AGAGGTGGTG 
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GGGACTTTTA AGGCTAAAGA CCTAATAGTC ACACCAGCTA CCATTTTAAA GGAAAAACCA 120 

GACCCCAATA ATCTGGTTTT TGGAACTGTG TTCACGGATC ATATGCTOAC GQTGGAGTGG 160 

TCCTCAGAGT TTGGATGGGA GAAACCTCAT ATCAAGCCTC TTCAGAACCT GTCATTGCAC 240 

CCTGGCTCAT CAGCTTTGCA CTATGCAGTG GAATTATTTG AAGGATTGAA GGCATTTOGA 300 

GGAGTAGATA ATAAAATTCG ACTGTTTCAG CCAAACCTCA ACATGGATAG AATGTATCGC 360 

TCTGCTGTGA GGGCAACTCT GCCGGTATTT GACAAAGAAG AGCTCTTAGA GTGTATTCAA 420 

CAGCTTGTGA AATTGGATCA AGAATGGGTC CCATATTCAA CATCTGCTAG TCTGTATATT 480 

COTCCTGCAT TCATTGGAAC TGAGCCTTCT CTTGGAGTCA AGAAGCCTAC CAAAGCCCTG 540 

CTCTTTGTAC TCTTOAGCCC AGTGGGACCT TATTTTTCAA GTGGAACCTT TAATCCAGTG 600 

TCCCTGTGGG CCAATCCCAA GTATGTAAGA GCCTGGAAAO GTGGAACTGG GGACT3CAAG €60 

ATGGQAGGGA ATTACGGCTC ATCTCTTTTT GCCCAATGTG AAGAOGTAGA TAATGGGTGT 720 

CAGCAGGTCC TGTGGCTCTA TGGCAGAGAC CATCAGATCA CTGAAGTGGG AACTATGAAT 780 

CTTTTTCTTT ACTGGATAAA TGAAGATGGA GAAGAAQAAC TGGCAACTCC TCCACTAGAT 840 

GGCATCATTC TTCCAGGAGT GACAAGGCGG TGCATTCTOQ ACCTGGCACA TCAGTGGGGT 900 

GAATTTAAGG TGTCAGAGAG ATACCTCACC ATGGATGACT TGACAACAGC CCTGGAGGGG 960 

AACAGAGTGA GAGAGATGTT TAGCTCTGGT ACAGCCTGTG TTGTTTGCCC AGTTTCTGAT 1020 

ATACTGTACA AAGGCGAGAC AATACACATT CCAACTATGG AGAATGGTCC TAAGCTGGCA 1080 

AGCCGCATCT TGAGCAAATT AACTGATATC CAGTATGGAA GAGAAGAGAG CGACTGGACA 1140 
ATTGTGCTAT CCTGA 



Seq ID NO: 26 Protein sequence* 
Protein Accession #: AAB08528 

1 11 21 31 41 SI 

I I I I I I 

MDCSNGSAEC TGEGGSKEW GTFKAKDLIV TPATILKEKP DPNNLVPGTV PTDHMLTVEW 60 
SSEFGWEKPH IKPLQULSLH PGSSALHYAV BLFEGLKAFR GVDNKIRLFQ PNLNMDRMYR 120 
SAVRATLPVF DKEELLECIQ QLVKLDQEWV PYSTSASLYI RPAFIGTBPS LGVKKPTKAL 160 
LFVLLSPVGP YPSSGTFNPV SLWANPKYVR AWKGGTGDCK MGGNYGSSLP AQCEDVDNGC 240 
QQVbWLYGRD HQITEVGTMN LFLYWINEDG EBBLATPPLD GIILPGVTRR CILDLAHQWG 300 
EFKVSERYLT MDDLTTALEG NRVREMFSSG TACWCPVSD ILYKGETIHI PTMENGPKLA 360 
SRILSKLTDI QYGREESDWT IVLS 



Seq ID NO: 27 Nucleotide fleauence: 
Nucleic Acid Accession #: XM_039209 

Coding sequence? 656.. 2758 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I t 

TCGCGCGGGG GCCGCCCCCT CCCCTTCCCT CCACCCTGGQ CGGGGGCGCG CGAGAAGCGG 60 

TGACGTCAAG GGGCGOGCTG TGGCAGCACC TCCCCGCGCG CTAGTTAAAA AGAAGAAGAA 120 

AAGAGGGAAC GAAACATGAG AGGCTGTGTG AGAAGCTGCA GCCGCCGGCA GAGGAGACCT 180 

CAGCATCATC TAGAGCCCAG CGCTGGCCCT GCCTCCGCCT GCCCCGCCGC CGCCGTCGCC 240 

GTTTCTGTTC CTGCTACTGT CCCACCTAAA CAACTCCCGT TACACGGACA AGTGAACATC 300 

TGTGGCTGTC CTCTCCTTTT CTTCCTCCTC TTCCAACTCC TTCTCCTCCT CCCACTTCCC 360 

AGCCGCAGCA GAAAGCCCCC AACCCAACTG ACACTGGCAC AACTGCAAAC GGTGTCATCC 420 

GCACAACTTT ATCTCGCTCC TCGGGCTCCC CTAAGGCATT GGACCCATCG CCGCGTCTTT 480 

TATTTTTTGC AAAGTTGCAT CGCTGTACAT ATTTTTGTCC CCGCCACCTC CCTCTGTCTC 540 

TGGAGTGCCC TACAGCCCCG CAAACTCCTC CTGGAGCTGC GCCCTAGTGC CCCTGCTGGG 600 

CAGTGGCGTT CCCCCCCATC CTCCCGCGCC CAGCCCCTGC TGCTCTGGGC AGACGATGCT 660 

GAAGATGCTC TCCTTTAAGC TGCTGCTGCT GGCCGTGGCT CTGGGCTTCT TTGAAGGAGA 720 

TGCTAAGTTT GGGGAAAGAA ACGAAGGGAG CGGAGCAAGG AGGAGAAGGT GCCTGAATGG 780 

GAACCCCCCG AAGCGCCTGA AAAGGAGAGA CAGGAGGATQ ATGTCCCAGC TGGAGCTGCT 840 

OAGTGGGGGA GAGATGCTGT GCGGTGGCTT CTACCCTCGO CTGTCCTGCT GCCTGOGGAG 900 

TGACAGCCCG GGGCTAGGGC GCCTGGAGAA TAAGATATTT TCTGTTACCA ACAACACAGA 960 

ATGTGGGAAG TTACTGGAGG AAATCAAATG TGCACTTTGC TCTCCACATT CTCAAAGCCT 1020 

GTTCCACTCA CCTGAGAGAG AAGTCTTGGA AAGAGACCTA GTACTTCCTC TGCTCTGCAA 1080 

AGACTATTGC AAAGAATTCT TTTACACTTG COGAGGCCAT ATTCCAGGTT TCCTTCAAAC 1140 

AACTGCGGAT GAGTTTTGCT TTTACTATGC AAGAAAAGAT GGTGGGTTGT GCTTTCCAGA 1200 

TTTTCCAAGA AAACAAGTCA GAGGACCAGC ATCTAACTAC TTGGACCAGA TGGAAGAATA 1260 

TGACAAAQTG GAAGAGATCA GCAGAAAGCA CAAACACAAC TGCTTCTGTA TTCAGGAGGT 1320 

TGTGAGTGGG CTGCGGCAGC CCGTTGGTGC CCTGCATAGT GGGGATGGCT CGCAACGTCT 1380 

CTTCATTCTG GAAAAAGAAG GTTATGTGAA GATACTTACC CCTGAAGGAG AAATTTTCAA 1440 

GGAGCCTTAT TTGGACATTC ACAAACTTGT TCAAAGTGGA ATAAAGGGAG GAGATGAAAG IS 00 

AGGACTGCTA AGCCTCGCAT TCCATCCCAA TTACAAGAAA AATGGAAAGT TGTATGTGTC 1560 

CTATACCACC AACCAAGAAC GGTGGGCTAT CGGGCCTCAT GACCACATTC TTAGGGTTGT 1620 

GGAATACACA GTATCCAGAA AAAATCCACA CCAAGTTGAT TTGAGAACAG CCAGAGTCTT 1680 

TCTTGAAGTT GCAGAACTCC ACAGAAAGCA TCTGGGAGGA CAACTGCTCT TTGGCCCTGA 1740 

CGGCTTTTTG TACATCATTC TTGGTGATGG GATGATTACA CTGGATGATA TGGAAGAAAT 1800 

GGATGGGTTA AGTGATTTCA CAGGCTCAGT GCTACGGCTG GATGTGGACA CAGACATGTG 1860 

CAACGTGCCT TATTCCATAC CAAGGAGCAA CCCACACTTC AACAGCACCA ACCAGCCCCC 1920 

CGAAGTGTTT GCTCATGGGC TCCACGATCC AGGCAGATGT GCTGTGGATA GACATCCCAC 1980 

TGATATAAAC ATCAATTTAA CGATACTGTG TTCAGACTCC AATGGAAAAA ACAGATCATC 2040 
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AGCCAGAATT CTACAGATAA TAAAGGGGAA AGATTATGAA AGTGAGCCAT CACTTTTAGA 2100 

ATTCAAGCCA TTCAGTAATG QTCCTTTGGT TGGTGGATTT GTATACCGGG GCTGCCAGTC 2160 

AGAAAGATTG TATGGAAGCT ACGTGTTTGQ AGATCGTAAT GGOAATTTCC TAACTCTCCA 2220 

GCAAAGTCCT GTGACAAAGC AGTGGCAAGA AAAACCACTC TGTCTCGGCA CTAGTGGGTC 2260 

CTGTAGAGGC TACTTTTCOG GTCACATCTT GGGATTTGGA GAAGATGAAC TAGGTGAAGT 2340 

TTACATTTTA TCAAGCAOTA AAAGTATGAC CCAGACTCAC AATGGAAAAC TCTACAAAAT 2400 

TGTAGATCCC AAAAGACCTT TAATGCCTGA GGAATGCAGA GCCAOGGTAC AACCTGCACA 2460 

GACACTGACT TCAGAGTGCT CCAGGCTCTG TCGAAACGGC TACTGCACCC CCACGGGAAA 2520 

GTGCTGCTGC AGTCCAGGCT GGGAGGGGGA CTTCTGCAGA ACTGCAAAAT OTGAGCCAGC 2560 

ATGTCGTCAT GGAGGTGTCT GTGTTAGACC GAACAAGTGC CTCTGTAAAA AAGGATATCT 2640 

TGGTCCTCAA TGTGAACAAG TGGACAGAAA CATCCGCAGA GTGACCAGGG CAGGTATTCT 2700 

TGATCAGATC ATTGACATGA CATCTTACTT GCTGGATCTA ACAAGTTACA TTGTATAQTT 2760 

TCTGGGACTG TTTGAATATT CTATTCCAAT GGGCATTTAT TTTTTATCCT GTCATTAAAA 2620 

AAAAAAGACT GTTATCCTGC TACACACTCC TGTGATTTCA TTCTCTTTTA TTAATTTAAA 2680 

AATAATTTCC AGAAATGTGC AGATCCTCTG TGTGTATGTC AGCATGTTTG TTCACATATG 2940 

CACATACACA TACTCATAAC CCCTATATGC GTTGTTGCAT AACAGATGAT TTTTTAAAAT 3000 

ATATACTTCC TTATGCAAAG TAATTTACAC AGAAATTCCA TTGTAAATTG ATAATGGATT 3060 

TTTTATGTTA CTAGAAGAGA TTATTTGACT TCCCAGGAAT TTTCTGTCTG TAATCACTAA 3120 

AGTCAACTTT AATAGAGTTT TGAAACAGTA CTGTGCAATC CGATGGATCT AATTAAAAAA 3180 

AAGGCAATAT TTTTATATTA AAGTACTATA CT AG GAG AG A ATGTTTCAGA ACTCCCTGAT 3240 

GAATTTCTAA GTGAGCAACT TGATATAAAA TTGTAATCTT CATTTTTGTC AOTGTATCCA 3300 

GTTACAGAAT GCTACACACT TACCTTTTTA TTGGCTGAGA AATCTGGTTA TTTCATCTTA 3360 

ATCTCAAGAT TGTTTTCAAG TGTTTTATAA TTAAATCATA ATAGCATATT TTAAAATCAA 3420 

TCTTCCTAAA AGGTCTGCTT TTATTGTATA TTTTATTTAA CAATAGGCAC TGGGTTTGTG 3480 
TTACATATTT ATATATTTTA TTTTATTTTT ATAATATAGA CATCACCTAG 

Seq ID NO: 28 Protein sequence; 
Protein Accession if: XP_0392Q9 



1 11 21 31 41 SI 

I I I I I I 

MLKMLSFKLL LLAVALGPFB GDAKFGERNE GSGARRRRCL NGNPPKRLKR RDRRMMSQLB 60 

LLSGGEMLCG GFYPRJjSCGj RSDSPGLGRL ENKIFSVTNN TECGKLLEEI KCALCSPHSQ 120 

SLPHSPEREV LERDLVLPLL CKDYCKEPFY TCRGHIPGFL QTTADEFCFY YARKDGGLCF 180 

PDFPRKQVRG PASNYLDQME EYDKVEEISR KHKHNCFCIQ KWSGLRQPV GALHSGDGSQ 240 

RLFILlEKEGY VKIIiTPEGEI FKEPYLDIHK LVQSGIKGGD ERGLLSLAFH PNYKKNGKLY 300 

VSYTTNQERW AIGPHDHILR WEYTVSRKN PHQVDLRTAR VFLEVAELiHR KHLGGQIiLFG 360 

PDGFLYIILG DGMITLDDME BMDGLSDFTG SVLRLDVDTD MOJVPYSIPR SNPHFNSTNQ 420 

PPEVFAHGLH DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARILQIIK GKDYESEPSL 480 

LEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFIjT LQQSPVTKQW QEKPLCLGTS 540 

GSCRGYFSGH ILGFGEDELC EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PEECRATVQP 600 

AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 660 
YLGPQCEQVD RNIRRVTRAG ILDQIIDMTS YLLDLTSYIV 



Seq ID NO: 29 NMCj,e9tide sequence; 
Nucleic Acid Accesaion #: NM_024756 

Coding sequence: 75.. 2924 (underlined aequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 I I 1 I I 

AAGACAACGT -CACTAGCAGT TTCTGGAGCT ACTTGCCAAG GCTGAGTGTG AGCTGAGCCT 60 

GCCCCACCAC CAAGATGATC CTGAGCTTCC TGTTCAGCCT TGGGGGCCCC CTGGGCTGGG 120 

GGCTGCTGGG GGCATGGGCC CAGGCTTCCA GTACTAGCCT CTCTGATCTG CAGAGCTCCA 180 

GGACACCTGG GGTCTGGAAG GCAGAGGCTG AGGACACCAG CAAGGACCCC GTTGGACGTA 240 

ACTGGTGCCC CTACCCAATG TCCAAGCTGG TCACCTTACT AGCTCTTTGC AAAACAGAGA 300 

AATTCCTCAT CCACTCGCAG CAGCCGTGTC CGCAGGGAGC TCCAGACTGC CAGAAAGTCA 360 

AAGTCATGTA CCGCATGGCC CACAAGCCAG TGTACCAGGT CAAGCAGAAG GTGCTGACCT 420 

CTTTGGCCTG GAGGTGCTGC CCTGGCTACA CGGGCCCCAA CTGCGAGCAC CACGATTCCA 480 

TGGCAATCCC TGAGCCTGCA GATCCTGGTG ACAGCCACCA GGAACCTCAG GATGGACCAG 540 

TCAGCTTCAA ACCTGGCCAC CTTGCTGCAG TGATCAATGA GGTTGAGGTG CAACAGGAAC 600 

AGCAGGAACA TCTGCTGGGA GATCTCCAGA ATGATGTGCA CCGGGTGGCA GACAGCCTOC 660 

CAGGCCTGTG GAAAGCCCTG CCTGGTAACC TCACAGCTGC AGTGATGGAA GCAAATCAAA 720 

CAGGGCACGA GTTCCCTGAT AGATCCTTGG AGCAGGTGCT GCTACCCCAC GTGGACACCT 780 

TCCTACAAGT GCATTTCAGC CCCATCTGGA GGAGCTTTAA CCAAAGCCTG CACAGCCTTA 640 

CCCAGGCCAT AAGAAACCTG TCTCTTGACG TGGAGGCCAA CCGCCAGGCC ATCTCCAGAG 900 

TCCAGGACAG TGCCGTGGCC AGGGCTGACT TCCAGGAGCT TGGTGCCAAA TTTGAGGCCA 960 

AGGTCCAGGA GAACACTCAG AGAGTGGGTC AGCTGCGACA GGACGTGGAG GACCGCCTGC 1020 

ACGCCCAGCA CTTTACCCTG CACCGCTCGA TCTCAGAGCT CCAAGCCGAT GTGGACACCA 1080 

AATTGAAGAG GCTGCACAAG GCTCAGGAGG CCCCAGGGAC CAATX3GCAGT CTGGTGTTGG 1140 

CAACGCCTGG GGCTGGGGCA AGGCCTGAGC CGGACAGCCT GCAGGCCAGG CTGGGCCAGC 1200 

TGCAGAGGAA CCTCTCAGAG CTGCACATGA CCACGGCCCG CAGGGAGGAG GAGTTGCAGT 1260 

ACACCCTGGA GGACATGAGG GCCACCCTGA CCCGGCACGT GGATGAGATC AAGGAACTGT 1320 

ACTCCGAATC GGACGAGACT TTCGATCAGA TTAGCAAGGT GGAGCGGCAG GTGGAGGAGC 1380 

TGCAGGTGAA CCACACGGCG CTCCGTGAGC TGCGCGTGAT CCTGATGGAG AAGTCTCTGA 1440 
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TCATGGAGGA GAACAAGGAG GAGGTGGAGC GGCAGCTCCT GGAGCTCAAC CTCACGCTGC 1500 

AGCACCTGCA GGOTGGCCAT GCCGACCTCA TCAAGTACGT GAAGGACTGC AATTGCCAGA 1560 

AGCTCTATTT AGACCTGGAC GTCATCCGGG AGGGCCAGAG GGAOQCCACG OQTGCCCTGG 1620 

AGGAGACCCA GGTGAGCCTG GACGAGCGGC GGCAGCTGGA CGGCTCCTCC CTGCAGGCCC 1680 

TGCAGAACGC CGTGGACGCC GTGTOGCTGG COGTGGACGC GCACAAAGOG GAGGGCGAGC 1740 

GGGCGCGGGC GGCCACGTOG OGGCTCCGGA GCCAAGTGCA GGCGCTGGAT GACGAGGTGG 1800 

GCGCGCTQAA GGCGGCCGCG GCCQAGGCCC GCCACGAGGT GOGCCAGCTG CACAGCGCCT I860 

TOGCOGCCCT GCTGGAGGAC GCGCTGCGGC ACGAGGCGGT GCTGGCOGCG CTCTTCGGGG 1920 

AGGAGGTGCT GGAGGAGATG TCTGAGCAGA CGCCGGGACC GCTGCCCCTG AGCTACGAGC 1980 

AGATCCGCGT GGCCCTGCAG GACGCCGCTA GCGGGCTGCA GGAGCAGGCG CTCGGCTGGG 2040 

AOGAGCTGGC CGCCOGAGTG ACGGCCCTGG AGCAGGCCTC GOAGCCCCCG CGGCOGGCAG 2100 

AGCACCTGGA GCCCAGCCAC GACGCGGQCC GCGAGGAGGC CGCCACCACC GCCCTGGCCG 2160 

GGCTGGCGOG GGAGCTCCAG AGCCTGAGCA ACGACGTCAA GAATGTCGGG CGGTGCTGCG 2220 

AGGCCGAGGC OGGGGCCGGG GCCGCCTCCC TCAACGCCTC CCTTGACGGC CTCCACAACG 2280 

CACTCTTOGC CACTCAGCGC AGCTTGGAGC AGCACCAGCG GCTCTTCCAC AGCCTCTTTG 2340 

GGAACTTCCA AGGGCTCATG GAAGCCAACG TCAGCCTGGA CCTGGGGAAG CTGCAGACCA 2400 

TGCTGAGCAG GAAAGGGAAG AAGCAGCAGA AAGACCTGGA AGCTCCCCGG AAGAGGGACA 2460 

AGAAGGAAGC GGAGCCTTTG GTGGACATAC GGGTCACAGG GCCTGTGCCA GGTGCCTTGG 2520 

GOGOGGCGCT CTGGGAGGCA GGATCCCCTG TGGCCTTCTA TGCCAGCTTT TCAGAAGGGA 2580 

CGGCTGCCCT GCAGACAGTG AAGTTCAACA CCACATACAT CAACATTGGC AGCAGCTACT 2640 

TCCCTGAACA TGGCTACTTC CGAGCCCCTG AGCGTGGTGT CTACCTGTTT GCAGTGAGCG 2700 

TTGAATTTGG CCCAGGGCCA GGCACCGGGC AGCTGOTGTT TGGAGGTCAC CATCGGACTC 2760 

CAGTCTGTAC CACTGGGCAG GGGAGTGGAA GCACAGCAAC GGTCTTTGCC ATGGCTGAGC 2820 

TGCAGAAGGG TGAGCGAGTA TGGTTTGAGT TAACCCAGGG ATCAATAACA AAOAGAAGCC 2880 

TGTOGGGCAC TGCATTTGGG GGCTTCCTGA TGTTTAAGAC CTGAACCCCA GCCCCAATCT 2940 

GATCAGACAT CATGGACTCG CCCAGCTCTC CTCGGCCTGG GGCTCTGGCC AAGGATGGGC 3000 

TGGAGGTCAT TCAGTTGGTC TGTCTCTTCC CTGGAAACCT TCTGCAAAGA TCGTGTGGTG 3060 

TACGTGGCTT CCCTOTAACC ACATGGGGCT TGGCCATTTC TCCATGATGA GAAGGACTGG 3120 

AATGCTTCTC CGGGCAGGAC ATGGTCCTAG GAAGCCTGAA CCTTGGCTTG GCATGCCTTC 3180 

TCAGACAGCA CGGCCTGGGC TCCAACTCTT CACCACACCC TGTATTCTAC AACTTCTTTG 3240 

GTGTTTTGCT CCTCCTGTGG TTGGAAACTT CTGTACAACA CTTTAAACTT TTCTCTTGCT 3300 

TCCTCTTCTC TTCTCCCTTA TCGTATGATA GAAAGACATT CTTCCCCAGG AGGAATGTTT 3360 

AAAATCGAGG CAACATTTTQ GCCAACATTG GAAAGCACTA GAGGGCAATG GGATTAAACC 3420 

AACCTGCTTG GTCTCTATTA GTCAGTAATG AAGACGACAG CCTGGCCAAC CAAGGGAAAG 3480 

GAAATTAGTA TCTTTAGTTT CAGTCATTCC TTGTAGGATA TGOTTEAGCT GTGCCCCCAC 3540 

CTAAAATATC ATCTTGAATT GTAATCCCTA TAATCCCCAC ATCAAGGGAG AGATCAGGTG 3600 

GAGGTAATTG GATCTTGGGG GCGGTTCCCC CATGCTGTTC TTGTGATAGT TCTCACGAGA 3660 

TCTGATGATT TTATAAGTTT GATAGTTCCT CCTGTGTTCA TTCTCCTTCC TGCCACCTTG 3720 

TGAAGATGCC TTGGTTCCTC TTCACTGTCT GCCATGATTG TAAGTTTCCT GAGGCCTCCC 3780 
CAGCCATGTG GAACAGTGAG TCAATTAAAC CTCTTTCCTT TATAAATT 



Seq ID NO: 30 Protein sequence: 
Protein Accession ft; NPJ>79032 

1 11 21 31 41 51 

I I I I I I 

MILS LLPS LG GPLGWGLLGA WAQASSTSLS DLQSSRTPGV WKAEAEDTSK DPVGRNWCPY 60 

PMSKLVTLLA LCKTEKPLIH SQQPCPQGAP DCQKVKVMYR MAHKPVYQVK QKVLTSLAWR 120 

CCPGYTGPNC BHHDSMAIPE PADPGDSHQE PQDGPVSFKP GHLAAVINEV EVQQEQQEHL 180 

LGDLQNDVHR VADSLPGLWK ALPGNLTAAV MEANQTGHEF PDRSLEQVLL PHVDTFLQVH 240 

FSPIWRSFNQ SLHSLTQAIR NLSLDVEANR QAISRVQDSA VARADFQELG AKFEAKVQEN 300 

TQRVGQLRQD VEDRLHAQHF TLHRSISBLQ ADVDTKLKRL HKAQBAPGTN GSLVLATPGA 360 

GARPEPDSLQ ARLGQLQRNL SELHMTTARR EEELQYTLED MRATLTRHVD EIKELYSBSD 420 

ETFDQISKVE RQVEBLQVNH TALRELRVIL MEKSLIMEBN KBEVERQLLE LNLTLQHLOG 480 

GHADLIKYVK DCNCQKLYLD LDVIREGQRD ATRALEETQV SLDERRQLDG SSLQALQNAV S40 

DAVSLAVDAH KAEGERARAA TSRLRSQVQA LDDEVGALKA AAAEARKEVR QLH8AFAALL 600 

EDALRKEAVL AALFGEBVLE EMSBQTPGPL PLSYEQIRVA LQDAASGLQE QALGWDELAA 660 

RVTALEQASE PPRPAEHLEP SHDAGREBAA TTALAQLARE LQSLSNDVKN VGRCCEAEAG 720 

AGAASLNASL DGLHNALPAT QRSLEQHQRL FHSLFGNFQG LMEANVSLDL GKLQTMLSRK 780 

GKKQQKDLBA PRKRDKKEAE PLVDIRVTGP VPGALGAALW BAGSPVAPYA SFSEGTAALQ 640 

TVKFNTTYIN IGSSYFPEHG YFRAPERGVY LPAVSVEFGP GPGTGQLVFG GHHRTPVCTT 900 
GQGSGSTATV FAMAELQKGE RVWFELTQGS ITKRSLSGTA FGGFLMFKT 

Seq ID NO: 31 Nucleotide sequence ? 
Nucleic Acid Accession #: AB037715 

Coding sequence i 370.. 3489 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAACGCTCAC AGAACAGGCA GTGCAATTCC ATGTTCCTCT TAAGTATGTT AGCCCTACCG 60 

GGAGCTGAGC TGGCCAGTCT ACTTGGAGAG GAAAAGTAGA TCTGGGGAAO GTGGAAGGGT 120 

CAGTTCCTAA GTGACTTCCT CCTCGGGGAT GGTAAGGGCA TTTGCTGATC TCCAGTGACT 180 

GCCTGGTGCC TCATGGTCAG ACTCGGCTGT CTCACTCCCA GATATCTGAT T TTGCA AAAA 240 

GGGACACACC TATCTGCAGC AAAGAAGACA CTGACCAGAT TGCGAGCGGT GCTTTTGGAT 300 
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GCTCTGTAGC CACCCGGQGC CCAGGAGGAC 
TCGGAGACCA. JGGCAGTGCA GCTGGTGCCC 
QAGOGCCGCC GATGTCAAGT ACATCTTCTT 
CCCAAGCTGT TGGCCAAGGA GCTTCTTQAC 
5 AAGGAGTACT TTGGAATAGC ATTCACAGAT 
GATCGAAGAG TATTGGAACA TGACTTCCCT 
TGTGTCAGGT TCTATATAQA AAGCATTTCA 
TTCTTTCTGA ACGCGAAGTC CTGCATCTAC 
GTGTTTGAAT TAGCTTCCTA TATTTTACAG 

10 GTTGTGAGGA GTGACTTGAA GAAGCTGCCA 
CCTTCCCTGG CCTACTGTGA AGACAGAGTC 
• ACAAGAGGTC AAGCAATCGT AAACTACATG 
GTTCACTATT ATGCAGTGAA GGACAAGCAG 
AAAGGGATCT TCCAGTATGA CTACCATGAT 

15 AGACAGTTGG AAAACCTGTA CTTCAGAGAA 
CGCAGGGCTT CAGTGACAAG GAGGACGTTT 
TATGCATGTC CGGCATTGAT CAAGTCCATC 
TATCTGGACA GAAAGCAGAG TAAGTCCAAA 
GCCATCGACC TGACCGAGAC GGGGACGCTG 

20 AAGGGGAAGA TCATCAQCGG CAGCAGCGGC 
GATAGCTCGC AGTCGGCCAA GAAGGACATG 
CTGGAGGAAA CCCTGCGTCA GAGGCTGGAG 
GAGCTCACGG GCAAGCTGCC AGTAGAATAT 
GTTCGGAGAA QAATAGGAAC AGCCTTCAAA 

25 GAGGAAGCTG AGCTGGAACG CCTGGAACGA 
GCCGCCCGCC GCCTAGCCAG TGACCCCAAC 
ACCTCGTATC TGAATGCACT QAAGAAACTG 
CGCATCAAGT CTGGGAAGAA ACCCACCCAG 
ATTGCCAGTG AAGACAGCTC CCTCTCAGAT 

30 GTTACCAGCA CAATATCCCC CCTACATTCT 
TCGCACAACA GGCCTCCTCC TCCCCAGTCC 
CGCAACGACT ATGACAAGTC ACCCATCAAG 
GAACCCTATG AGAAGGTCAA GAAGCGCTCC 
TTCCCCAGCA CAGGAAGCTG TGCGGAAGCC 

35 CCCATCCGCG GCCTCCCGCA CTGGAACTCC 
CGGGTCCGGA GTCCCCACTA CGTCCATTCC 
CTGCACAGCC TCGCACTGCA CTTTAGGCAC 
CTCCTGGGCT CGGAAAACGA CACCGGGAGC 
AGCAACGGCT CAGACCCCAT GGACGACTGC 

40 CACTACTACC CGGCGCAGAT GAACGCCAAC 
AAGGCGCGCC AGAGGCAGAG GCAGCGGCAG 
TCGGGCAGCA TGCCCAACCT GOCGGCGCGC 
GGCGGTGTGT ACCTGCACAG CCAGAGCCAG 
CCGCTGTACA TOGAGGGCGG CGCCACGCCC 

45 GAGTGCCACT ACAGCGTCAA GGCTCAGTTC 
CTGTTCAAGG AGAGCTGGCG CGGCGGCGGC 
CCGTCGCGAT CGCAGATCCT GCGGACTCOG 
GGCGCGGGCC GTGCCGCCGT CTCAGACGAG 
TCGCACAAGG AGCACAGCOG CCTGTCGCAC 

50 CAGTACAGCA CCTCCTCCCA GAGCACCTTC 
CAGATGTGCA AGGCCACGTC AGCTGCCTTA 
AGTGAAATTG GAGCCACCCC CCCAAGCAGC 
GAAGGAACAG AAAACTCACC CATTCTGOAT 
GATGAATAGA GGAGCTACAA TGATAGCTGT 

55 GCTGATGTCC AGTGGTACGG GCAGGAAAAA 
CCGGCCTAAT CTGACCGCCT CAACGCCATT 
TTACCCAGAC GCACCGTCAC CCTGCACCAG 
CTCCGCATTC CCTCCCCCTT GAAAACCTOA 
CACTGTGTGT CCCCTGGCGC TCTTGCCCAT 

60 • CTTGGTGGCT TCCCTCTGCC ATGACAGCCC 
GGCATCCAAT TCCTGCGGAT AAGTAGCGTT 
CAGGGTGACC CAGAAAGACG ATTCAGCTGT 
CAAGCACTTC ATGAAGAGGA GGCCTCGTGG 
GATGGGACAG CTTGTGGGQA TGGCTATGGG 

65 ACACCAGAAA TGCATCGGAG GACCACAATC 
TAAAAACATA AAAAATTAAG AGGGGCCAAG 
TTTTAAATTC TGAACTGCTA CTACACACAA 
CTCTCTCTAG CCCTCTCCCT TACTGGCCCA 
GCCCCAATGC CACGGTAAAG GCGAGGAAGT 

70 ATCCATCTGG ACACAAAGAG AGACCTGTGG 
CATGCAGGGG GTTCAGCCGA GCCCAAGACT 
AACGTAAGGT GATAATGGCC AAAAOTGGTT 
ATCCTATTTT TTTGCATAAG GTGTTTCATT 
ACATTGCGAT CCATTCAGTG TTTAACTOTC 

75 GTGACAAAAG AGCTCAGATC CGACTTCTCC 
TGCCCTTAGG TAGAAAGATT TGACTCGTGT 



TGACTCGGCA GCAGGATTCG TGCATGGGAA 360 

GACTCAGCTC TCGGCCTGCT GATGATGACG 420 

GATGACAGGA AGCTGGAACT CCTAGTACAG 480 

CTTGTGGCTT CTCACTTCAA TCTGAAGGAA 540 

GAAAOGGGAC ACTTAAACTG GCTTCAGCTA 600 

AAAAAGTCAG GACCCGTGGT TTTATACTTT 660 

TACCTGAAGG ATAATGCTAC CATTGAGCTT 720 

AAGGAGCTTA TTGACGTTGA CAGCGAAGTG 780 

GAGGCAAAGG GAGATTTTTC TAGCAATGAA 840 

GCCCTTCCCA CCCAAGCCCT GAAGGAGCAC 900 

ATTGAGCACT ACAAGAAACT GAACGGTCAG 960 

AGCATCGTGG AGTCTCTCCC AACCTACGGG 1020 

GGCATACCAT GGTGGCTGGG CCTGAGCTAC 1080 

AAAGTGAAGC CAAGAAAGAT ATTCCAATGG 1140 

AAGAAGTTTT CCGTGGAAGT TCATGACCCA 1200 

GGGCACAGCG GCATTGCAGT GCACACGTGG 1260 

TGGGCTATGG CCATAAGCCA ACACCAGTTC 1320 

ATCCATGCAG CACGCAGCCT GAGTGAGATC 1380 

AAGACCTCGA AGCTGGCCAA CATGGGTAGC 1440 

AGCCTGCTGT CTTCAGGTTC TCAGGAATCA 1500 

CTGGCTGCCT TGAAGTCCAG GCAGGAAGCT 1560 

GAACTGAAGA AGCTQTGTCT CCGAGAAGCT 1620 

CCCCTGGATC CAGGGGAGGA ACCACCCATT 1680 

CTGGATGAAC AGAAAATCCT GCCCAAAGGA 1740 

GAGTTTGCCA TTCAGTCCCA GATTACGGAG 1800 

GTCAGCAAAA AACTGAAGAA ACAAAGGAAA 1860 

CAGGAGATTG AAAATGCAAT CAATGAGAAC 1920 

AGGGCTTCGC TGATCATAGA OGATGGAAAC 1960 

GCCCTTGTTC TTGAGGATGA AGACTCTCAG 2040 

CCTCACAAGG GACTCCCTCC TCGGCCACOG 2100 

CTGGAGGGAC TCCGACAGAT GCACTATCAC 2160 

CCCAAAATGT GGAGTGAGTC CTCTTTAGAT 2220 

TCTCACAGCC ATTCCAGCAG CCACAAGCGC 2280 

GGCGGAGGAA GCAACTCCTT GCAGAACAGC 2340 

CAGTCCAGCA TGCCGTCCAC GCCAGACCTG 2400 

ACGAGGTCGG TGGACATCAG CCCCACCCGA 2460 

CGGAGCTCCA GCCTGGAGTC CCAGGGCAAG 2520 

CCCGACTTCT ACACCCCGCG GACTCGTAGC 2580 

TCGTCGTGCA CCAGCCACTC GAGCTCGGAG 2640 

TACTCCACGC TGGCCGAGGA CTCGCCGTCC 2700 

CGGGCGGCGG GCGCACTGGG CTCAGCCAGC 2760 

GGGGGTGCGG GGGGCGCGGG GGGCGCGGGG 2820 

CCCAGCTCGC AGTACCGCAT CAAGGAGTAC 2880 

GTGGTGGTGC GCAGCCTGGA GAGCGACCAG 2940 

AAGACGTCCA ACTCCTACAC GGCGGGCGGC 3000 

GGCGACGAGG GCGACAOGGG CCGCCTGACG 3060 

TCGCTGGGCC GCGAGGGCGC CCACGACAAG 3120 

CTGCGCCAGT GGTACCAGCG TTCCACCGCC 3180 

ACCAGCTCCA CCTCCTCGGA CAGCGGCTCG 3240 

GTGGCGCACA GCAGGGTCAC CAGGATGCCC 3300 

CCTCAAAGCC AGAGAAGCTC GACACCGTCA 3360 

CCCCACCACA TCCTAACCTG GCAGACTGGA 3420 

GGGTCTGAGT CTCCACCTCA CCAAAGTACT 3480 

TTCCTGGATT CCTCCCTCTA TCCAGAACTA 3540 

GCCAAGCCCG GGACCCTCGT GTGAGCCAGC 3600 

CTGAGATCAC CTCACTGCCT CTCATTTGCC 3660 

CTTTGGCCCT CAGCACTTTT TTTCTCCTGT 3720 

CTGAGGAGAC ATTCTGGAAG GTTCCGGTCC 3780 

AGAGAGCCAG ACACCAATCC TCAATGGCAC 3840 

CTAGGCCAGG AACCATCAGG GGGGCCAGCC 3900 

GGGAGAGAAC GGGAAAGGGG ACTTGGGTTA 3960 

GTCCAGCCTG CCACCCATAC GTAGGCCAAC 4020 

CATATTCAGT TTACACCTGA AATATTCCTT 4080 

GGAAGGGGAG GTTGAGAAAG GAAGTTCTCG 4140 

AGTTCTATGC TQCCAAAGAT TAAAAATAAA 4200 

AGGAAGACAT TCTTTCTGCA AGGAAATTTC 4260 

GTGAAAGTCA ACCCTATGTA AACTGGTGTC 4320 

CTTCTCTCTC CGTAGAGAGC CTGAAAAACT 4380 

CTTGGCTGGC GTTGCTGACT CACAGTCGCC 4440 

GAGTCATAGA GGGTACTGTT AGCCCCGGTC 4500 

CAAAGCTGCT TTCCTTTCAG GATTTGTAGT 4560 

CTCTCTCATT AAACCAACCA GTAAAAGCGT 4620 

TTCGTTTTTA TGGGAAACCA AGGGAAAAGC 4680 

GTGGCTCATT TTCTGTTCGT TAGCACTTGT 4740 

TATGTGTCAC TTATTCCAAG AACCCAACTA 4800 

GTCTACTAGC CAACAGGCAG AGCAGGGTTG 4860 
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AAAAAAATAT 
ACATTTGTGT 
GCACGTTTAC 
GGCCTCAAAA 
CACAAAAAGG 
GGGGTGGGGG 
CCAGGGACAQ 
AGCATAAAAA 
GGCTCTACCA 
CCTTCTGCTO 
GGACCOOGCA 
GGGCCTGCTT 
TTCATATTCC 
ATTAAATQAT 
AGCTACAAAG 
TGGGCCATTT 
CAGGAAAAGT 
ATGCAQAGTA 
GTTGCAAGGA 
CAGAGAOAAA 
AAGACAOAGA 
TACATAAAAC 
TGTAGTTATT 
AGGTGAAATA 
AAAGAAATTG 
GAGCCAGGTT 
AATTTGTTTT 
TCAGATGACC 
AATAGTTTAT 
AGGTTGCTCC 
TGTAATCAAT 
GCTTTTAATT 
AGTGTTQTCA 



CAGCTCCCAA 
GCAGATACCA 
ATGfTTTTGAG 
ATACTTTTAT 
TTTCCGCAGA 
TGGAATTTTT 
GAGCCAGGGT 
CAAAGAAAAA 
GACCAGGAGG 
CAGCCTGGAG 
GGGACAAGGC 
TCCCCAGCTC 
CAGAACGCTT 
TCTAGGGATT 
AACAGTGATT 
TTCTTTCTCC 
CAAAAGGGAA 
GCTTGAAATC 
ATGAGAGGCA 
TGGAGAGCAG 
TTAAGTAAAA 
GTTAGTOCTT 
GTACACAAGC 
CGAAGTCCTT 
CCTGTTTCAG 
GATTTTTTAT 
CATTCAGTAT 
AGTTACTGCT 
TACAAGTTGT 
TGAAACTGAC 
GAAAAAGATG 
TATTCTTTTT 
ACACTTATTA 



AGGGCCCATG 
AAAGAGGAGG 
CTATGCTTCA 
AGTAACAAGT 
GGTGGTATGC 
TTCTCACTCT 
GGGGGTAGTT 
TCTTOGCTTT 
GTAAGGATGG 
ACCACCGAGA 
GGGCCGTGGC 
CATGCATGGC 
TAAGTGTACA 
CACTGGGGGA 

■ rmrmTi ' 

CAAAGAAGAT 
AAGGCAGCTG 
TAGTCTGGAG 
AAAATTCTAA 
GAATTACAGT 
CAGGTTTTAC 
TGAGACTGAC 
ACTTGCAAAC 
GGTCTGATAT 
CCAGAAGACT 
TTTATTATAT 
TAGTTTAGTT 
TAGTTAACTA 
GTAAAATGGA 
TGTAGAGCAT 
TACGTTGTAG 
GTATTAAGAA 
AAGCATTTTC 



TGTCTACATC 
AAAGAAGAAA 
AACACAACTG 
GCACGACTTT 
TGTGCTTTTG 
AATGACTTCC 
TTGTGGGAAA 
TCATGTATGT 
ACACTAAAAT 
GTCGAGCTGG 
CTCCTCCACC 
TGGACTGQTG 
CCTGCAGGAT 
TATTTTTGTT 
CTCCCTTCCC 
TCATGGATAG 
ATGAGGTTAC 
AAAACTGGAT 
AGATTTGGGT 
TCCAACAAAC 
TGTTTAGCTG 
ATGATTAATG 
TCTTTATCCC 
AAAGCCCCTA 
GGTGAAAACA 
GCAGGTGAGT 
CIAAATATAG 
GGTGTAAAGT 
CTCTAGTTTA 
GTAAAATGAT 
ACAAAGTTGC 
TTTGTATAGT 
AAAATG 



ATCAGTTACT 
AAAATTAATG 
GAAAGCCATC 
AGTTGGGTTA 
GCGCAAGTGG 
TATTGGAAAG 
GCAGAACTGA 
GGAATCCAAG 
GAAACAAATA 
GGCACACACA 
AAGTCTCTCT 
ATTCCAGGGT 
AAAGAGATAC 
GCTTTTACTT 
CATTCAGAAA 
TCAGACTGAA 
ATGGTTACAT 
CAAGATTCTA 
TATATTTTCA 
ATCATGATAG 
AGTTCAGTTA 
ATCAGTGTGG 
TATTTCTTTA 
TTGGATTCTT 
CATACATCAG 
GTTGAAACTG 
CAAACCCCAT 
TTTACATATA 
ATAATGGGGG 
TTTACTGGAT 
AGAATTAAAA 
ATCTTTACAT 



GTCATGCACC 
TGTGGGAGCT 
AATCTTCAAA 
TTCAAGATGG 
TGGGGGGATG 
GCATTGACAG 
AGTTAGCTTA 
AATAACCATA 
CCAAGGTATT 
CACCTGGCOG 
AGACAATTCA 
GCAGAAGGGA 
CGGTTACATT 
TCATGGTTAG 
CATTATACAT 
CTGTGTGCAA 
GTTCTACATC 
GCCCACTGGA 
ACTTGGGGGA 
TCTGGTAGTC 
ATACAAAATG 
TGGGAAATGA 
AAACAAAATA 
CGGATGCGTA 
ACTATGTTGT 
TTAAAATTCC 
CCAGGTGCTA 
CATTAATTTC 
AAAAAAGATT 
TCTGTTCAAC 
AAAGAAATCT 
TTTGCAAAAC 



4920 
4980 
5040 
5100 
5160 
5220 
52B0 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 



Seq ID NOt 32 protein sequence: 
Protein Accession #; BAA92532 



40 



45 



50 



55 



60 



65 



i 
I 

MAVQLVPDSA 
FGIAFTDBTG 
NAKSCIYKEL 
AYCKDRVIEH 
FQYDYHDKVK 
PAIiIKSIWAM 
IISGSSGSLL 
GKLPVEYPLD 
RLASDPNVSK 
BDSSLSDALV 
YDKSPIKPKM 
GLPHWNSQ3S 
SENDTGSPDF 
QRQRQRQRAA 
IEGGATPVW 
SQILRTPSLG 
TSSQSTFVAH 
ENSPILDGSE 



11 
I 

LGLLMMTEGR 
HLNWLQLDRR 
IDVDSBWFE 
YKKLNGCTRG 
PRKIFQWRQL 
AISQHQFYLD 
SSGSQESDSS 
PGEEPPIVRR 
KLKKQRKTSY 
LEDEDSQVTS 
WSESSLDBPY 
MPSTPDLRVR 
YTPRTRSSNG 
GALGSASSGS 
RSLESDQECH 
REGAHDKGAG 
SRVTRMPQMC 
SPPHQSTDE 



21 
I 

RCQVHLLDDR 
VTjKHDFPKKS 
LASYILQEAK 
QAIVNYMSIV 
ENLYFREKKF 
RKQSKSKIHA 
QSAKKDMLAA 
RIGTAFKLDE 
LNALKKLQEI 
TISPLHSPHK 



31 
I 

KLELLVQPKL 
GPWLYFCVR 
GDPSSNEWR 
RSLPTYGVHY 
SVEVHDPRRA 
ARSLSEIAID 
LKSRQEALEE 
QKILPKGEEA 
ENAINENRIK 



SPHYVHSTRS 
SOPMDDCSSC 
NPKLAARGGA 
YSVKAQFKTS 
RAAVSOELRQ 
KATSAALPQS 



VDISPTRLHS 
TSHSSSEHYY 
GGAGGAGGGV 
NSYTAGGLFK 
WYQRSTASHK 
QRSSTPSSEI 



41 
I 

LAKELLDLVA 
FYIESISYLK 
SDLKKLPALP 
YAVKDKQGIP 
SVTRRTFGHS 
LTETGTLKTS 
TLRQRLEELK 
ELERLEREFA 
SGKKPTQRAS 
RPPPPQSLBG 
TGSCAEAGGG 
LALHFRHRSS 
PAQMHANYST 
YLKSQSQPSS 
ESHRGGGGOE 
EHSRLSHTSS 
GATPPSSPHH 



51 
I 

SHFNLKEKEY 
DNATIELFFL 
TQALKEHPSL 
WWLGLSYKGI 
GIAVHTWYAC 
KLANKGSKGK 
KLCLREAELT 
IQSQITEAAR 
LIIDDGNIAS 



SNSLQNSPIR 
SLESQGKLLG 
IAEDSPSKAR 
QYRIKEYPLY 
GDTGRLTPSR 
TSSDSGSQY3 
ILTWQTGEAT 



Seq ID NO: 33 ft^clepWe aeguenc^: 

Nucleic Acid Accession #: NM_014331 

Coding sequence: 1..1506 (underlined sequences 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



correspond to start and stop codons) 



70 



75 



ATGGTCAGAA 
AACGGGAGGC 
AAGAGGAAAG 
GGAATCTTCA 
ACCATCTGGA 
GGAACAACTA 
TTACCAGCTT 
GTGATATCCC 



11 
I 

AGCCTGTTGT 
TGCCTTCCCT 
TCACTTTACT 
TCTCTCCTAA 
CGGTGTGTGG 
TAAAGAAATC 
TTGTACGAGT 
TGGCATTTGG 



21 
I 

GTCCACCATC 
GGGCAACAAG 
GAGGGGAGTC 
GGGCGTGCTC 
GGTCCTGTCA 
TGGAGGTCAT 
CTGGGTGGAA 
ACGCTACATT 



31 
I 

TCCAAAGGAG 
GAGCCACCTG 
TCCATTATCA 
CAGAACACGG 
CTATTTGGAG 
TACACATATA 
CTCCTCATAA 
CTGGAACCAT 



41 

1 

GTTACCTGCA 
GGCAGGAGAA 
TTGGCACCAT 
GCAGCGTGGG 
CTTTGTCTTA 
TTTTGGAAGT 
TACGCCCTGC 
TTTTTATTCA 



51 
I 

GGGAAATGTT 60 

AGTGCAGCTG 120 

CATTGGAGCA 180 

CATGTCTCTG 240 

TGCTGAATTG 300 

CTTTGGTCCA 360 

AGCTACTGCT 420 

ATGTGAAATC 480 
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45 



CCTGAACTTG 
AGCATGAGTG 
GCAATTCTGA 
TTTAAAGACG 
TATGGAATGT 
AACCCTGAAA 
TATGTGCTGA 
AATGCAGTGG 
ATCTTTGTTG 
TTATTCTATG 
CGCAAGCACA 
TTCTCTGGAG 
GGGCTGGCAQ 
TTCAAGGTGC 
CTTTCCCTCT 
GTCCCTGOGT 
TCAGAQAAAA 
TTATGAACTA 
TTTTTACTTC 
CAGTTATTTT 
AACTCTATGT 
GTCTCTGATA 
TCTCTACAAC 
TATATATGGG 
TTTTCAATTC 
ATTTTACATT 
GCTTTAATGG 
TTAAAGAAGA 
AAAAATCCTT 
TTATCTGTCA 
AGCAAGAGTT 
TACCCCTGAT 
TGAGAGAAAT 
CTACATGCAA 
TGAATTTTGA 
CTTCAGATGA 
AAGAAATGTC 
ATCTAGGCTT 
CTGATAAGAA 
GTTTTGCCAG 
GCACTTTGGG 
CAACATGGAG 
GCTGGTAATC 
GAGGTTGCAG 
CCATCTCCAA 



CGATCAAGCT 
TCAGCTGGAG 
TAATTATAGT 
CGTTTTCAGG 
ATGCATATGC 
AAACCATTCC 
CAAATGTGGC 
CAGTGACCTT 
OCCTCTCCTG 
TTGCGTCTCG 
CTCCTCTACC 
ACCTCGACAO 
TTGCTGGGCT 
CACTGTTCAT 
ATTCGGACCC 
ATTATCTCTT 
TAACCAGAAC 
ATGGACTTGA 
ATTTTCTGAA 
TATTCATATA 
AGTTATAGAA 
CCTACCTATT 
ATATGTTAGC 
TTTTGTAAAO 
TGAAAAAAAG 
GACATTGCAT 
ATTATACCCA 
GTTTCTAGGG 
GAGAATTTAT 
TTTTTTTTTT 
AGTTTGGTAT 
GAGTCTATCT 
AACCAACAAA 
TGTTAGTAAT 
CAGTTTGTGC 
AACTGTCCAG 
GCTGTAAATA 
TGTCAGTAAT 
GAAAATTGAA 
TATTAGAAAA 
AGGCTGAGGG 
AAACCCCATC 
TCAGCTATTG 
TGAGCCAAGA 
AAAAAAAAAA 



CATTACAGCT 
OGCOCGGATC 
CCCTGGAGTT 
AAGAGATTCA 
TGGCTGGTTT 
CCTTGCAATA 
CTACTTTACG 
TTCTGAGCGG 
CTTTGGCTOC 
AGAGGGTCAC 
AGCTGTTATT 
TCTTTTGAAT 
GATTTATCTT 
CCCAGCTTTG 
ATTTAGTACA 
TATTATATGG 
ATTACAAATA 
GATCTTGGCA 
AGTCTAGAGA 
TTTTAGCATA 
AGTGAATATG 
GGGGTTAGGA 
ACGGCAAAGA 
ATGGTTTTAC 
CATACATCAT 
TGCTTCCCCT 
GAGCACTTTG 
GCTACTGTTT 
TATGTCAGAT 
TCACATCAGT 
TAAATCCTCA 
AAACATATGC 
GAAGATGTTC 
TCTGAAGTTT 
ATTTTCTTTA 
ATTAATTAGG 
AGATTTACAA 
TTCCACACCT 
ATGAGAATCT 
TACTGTGAGC 
GGTGGATCAC 
TCTACTAAAA 
AGGAGGCTGA 
TTGCACCACT 



GTGGGCATAA 
CAGATTTTCT 
ATGCAGCTAA 
AGTATTACGC 
TACCTCAACT 
TGTATATCCA 
ACCATTAATG 



ATGAACGGTG 
CTTCCAGAAA 
GTTTTGCACC 
TTCCTCAGTT 
OGATACAAAT 
2TTTCCTTCA 
GGGATTGGCT 
GACAAGAAAC 
ATACTGGAAfl 
ATCTOCCCAA 
ATTACAACTT 
TTCGAACTAA 
CAGTTATTCT 
GAAAAGACTA 
ACCTTCAAAT 
ACACTACAGA 
GATTATGGCA 
TAGATACCAA 
AACAAAGGTC 
ATGAGACACA 
GTTTTTTCAT 
TTGATCAGGA 
TTAGAACAAC 
ATTTTAAGCC 
AAAATAATAG 
TTTAAATTTA 
TACATTTTAT 
AAAAGGCATA 
CTGATGTTTC 
TAATTATCAT 
GTGGATAAGT 
CGGGCATGGT 
CTGAGGTCGG 
ATACAAAATT 
GGCAGGAGAA 
GTACTCCAGC 



CTGTAGTGAT 
TAACCTTTTG 
TTAAAGGTCA 
GGTTGCCACT 
TTGTTACTGA 
TGGCCATTGT 
CTGAGGAGCT 
ATTTCTCATT 
GTGTGTTTGC 
TCCTCTCCAT 
CTTTGACAAT 
TTGCCAGGTG 
GCCCAGATAT 
CATGCCTCTT 
TCGTCATCAC 
CCAGGTGGTT 
TTGTACCAGA 
GGGGAGACAC 
TGGTGATAAA 
TTTCTAAGAA 
ATQAGTCGCA 
GACAATTACT 
TGAAGACTGA 
TGTCTATACT 
AAGAGGAGAG 
TTTAGATAAC 
AGTGGGGATT 
TCCAGGAGTT 
TCATTATCAG 
AAGTGTATAA 
CACCTGTTTC 
TTCAAATTAC 
TCCCATATCT 
TGGCTATTTT 
ATTCTTCTGT 
TATTAACATA 
TAGAAAATTT 
TCAACTTGCA 
QTTTGTGTTC 
GGCTTACATC 
GAGTTCTAGA 
AGCTGGGCAT 
TTGCTTGAAC 
CTGGGTGACA 



GGTCCTAAAT 
CAAGCTCACA 
AACGCAOAAC 
GGCTTTTTAT 
AGAAGTAGAA 
CACCATTGGC 
GCTGCTTTCA 
AGCAGTTCCG 
TGTCTCCAGG 
GATTCATGTC 
GATAATGCTC 
GCTTTTTATT 
GCATCGTCCT 
CATGGTTGCC 
TCTGACTGGA 
TAGAATAATG 
AGAAGATAAG 
AAAATAGGGA 
CAAAAGGAGT 
ATTTAGTTAT 
CAATTCTTGA 
ATGTGGTCAT 
GATTTTTCTG 
GTGAAAAGTG 
AAAGAAATTT 
AAACACTCAT 
GTTGAATACA 
ATGTTTAAGT 
GAAGTTTTAG 
CACATCTTAG 
ACTAATAACT 
ATTATCAACA 
GTAATCATAT 
TACACGATGA 
TAAAATATCT 
AAAATTGCAA 
CCACTTCTAT 
AAAGAGACAA 
AGAAGATGTT 
TGTAATCCCA 
CCAGCCTGAC 
GGTGGCACAT 
CCGGGAGGCG 
AAGTCAGACT 



540 
€00 
€60 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
25B0 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 



50 



Seq ID NO: 34 pro^n sequence^ 
Protein Accession ft: NP 055146 



55 



60 



1 

I 

MVRKPWSTI 
GIFISPKGVL 
LPAFVRVWVE 
SMSVSWSARI 
YGMYAYAGWF 
NAVAVTFSER 
RKHTPLPAVI 
FKVPLFIPAL 
SEKITRTLQI 



11 
I 

SKGGYLQGNV 
QNTGSVGMSL 
LLIIRPAATA 
QIFLTFCKLT 
YLNFVPBEVB 
LLGNPSLAVP 
VLHPLTMIML 
PSFTCLFMVA 
ILEWPEEDK 



21 
I 

NGRLPSLGWK 
TIWTVCGVLS 
VISLAFGRYI 
AILIIIVPGV 
NPBKTIPLAI 
IFVALSCFGS 
PSGDLDSLLN 
LSLYSDPFST 
L 



31 
I 

EPPGQSKVQL 
LFGALSYAEL 
LEPPFIQCEI 
MQLIKGQTQN 
CISMAIVTIG 
MNGGVFAVSR 
PLSPARWLPI 
GIGPVITLTG 



41 
I 

KRKVTLLRGV 
GTTIKKSGGH 
PKLAIKLITA 
PKDAFSGRDS 
YVLTNVAYFT 
LPYVASREGH 
GLAVAGLIYL 
VPAYYLPIIW 



51 
I 

SIIIGTIIGA 60 

YTYILBVPGP 120 

VGITWMVLN 180 

SITRLPLAFY 240 

TINAE HULLS 300 

LPBILSMXHV 360 

RYKCPDMHRP 420 

DKKPRWPRIM 480 



65 fieq ID NO: 35 Nucleotide sequence: 
Nucleic Acid Accession #: NMJJ02422 

Coding sequence: 64.. 1497 (underlined sequences correspond to start and stop codons) 



70 l 11 21 31 41 51 

I I I I 1 I 

ACAAGGAGGC AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT CCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATGGAG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 180 

75 AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAGGA CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATGCAGAAG TTCCTTGGAT TGGAGGTGAC GGGGAAGCTG 300 
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GACTCCGACA CTCTGGAGOT GATGCGCAAG CCCAQOTGTO QAGTTCCTGA TSTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTOTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGG6AAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTQATATA 540 

ATGATCTCTT TTQCAGTTAG AGAACATQQA GACTTTTACC CTTTTGATGG ACCTGGAAAT 600 

GTTTTGGCCC ATGCCTATGC CCCTGGOCCA GGGATTAATG GAGATGCCCA CTTTOATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTQ AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGAOCT GACTOCGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

CAGTCCCTCr ATGGACCTCC CCCTGACTCC CCTGAGACCC CCCTGGTACC CACGGAACCT 900 

OTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAQAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGOG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTOATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTOGTTT TCATTTTTAA AGGAAATCAA 1140 

TTCTGGGCCA TCAOAGGAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGAAA TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG A AT T T GGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 1440 

CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTOT^SfiAAG 1500 

AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAQC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTOGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTQ CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAGAAGGGGA 1740 

AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA 1800 
AGACGATTTG TCAGTTGTTT T 



Seq ID NO: 36 Protein sequence j 
Protein Accession #: NP_002413 



1 11 21 31 41 51 

I I I I t ! 

MKSLPILLLL CVAVCSAYPL DGAARGSDTS MNLVQKYLEN YYDLKKDVKQ FVRHKDSGPV 60 

VKKIREMQKP LGLEVTGKLD SDTLBVMRKP RCGVPDVGHF RTFPGIPKWR KTHtiTYRXVN 120 

YTPDLPKDAV DSAVEKALKV WEEVTPLTFS RLYEGEADIM IfiFAVREHGD PYPFDGPGNV 180 

IAHAYAPGPG INGDAHFDDD EQWTKDTTGT NLFLVAAHEI GHSLGLFHSA NTKALMYPLY 240 

HSLTDLTRFR LSQDDINGIQ SLYGPPPDSP BTPLVPTEPV PPEPGTPANC DPAbSFDAVS 300 

TLRGEILIFK DRHFWRKSLR KLBPBLHLIS SFWPSLPSGV DAAYEVTSKD LVFIPKGNQF 360 

WAIRGNEVRA GYPRGIHTLG PPPTVRKIDA AISDKEKNKT YFFVEDKYWR FDEKRNSMBP 420 
GFPKQIAEDF PGIDSKIDAV FEEFGFPYFP TGSSQLEFDP NAKKVTHTLK SNSWLNC 



Seq ID NOi 37 Nucleotide aeauence} 
Nucleic Acid Accession #: NM_003245 

Coding sequence: 112.. 3624 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

GGACGCACAG GCATTCCCCG CGCCCCTCCA GCCCTCGCCG CCCTCGCCAC CGCTCCCGGC 60 

CGCCGCGCTC CGGTACACAC AGGATCCCTG CTGGGCACCA ACAGCTCCAC CATGGGGCTG 120 

GCCTGGGGAC TAGGOGTCCT GTTCCTGATG CATGTGTGTG GCACCAACCG CATTCCAGAG 180 

TCTGGCGGAG ACAACAGCGT GTTTGACATC TTTGAACTCA CCGGGGCCGC CCGCAAGGGG 240 

TCTGGGCGCC GACTGGTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATCGAGGAT 300 

GCCAACCTGA TCCCCCCTGT GCCTGATGAC AAGTTCCAAG ACCTGGTGGA TGCTGTGCGG 360 

GCAGAAAAGG GTTTCCTCCT TCTGGCATCC CTGAGGCAGA TGAAGAAGAC CCGGGGCACG 420 

CTGCTGGCCC TGGAGCGGAA AGACCACTCT GGCCAGGTCT TCAGCGTGGT GTCCAATGGC 480 

AAGGCGGGCA CCCTGGACCT CAGOCTGACC GTCCAAGGAA AGCAGCACGT GGTGTCTGTG 540 

GAAGAAGCTC TCCTGGCAAC CGGCCAGTGG AAGAGCATCA CCCTGTTTGT GCAGGAAGAC 600 

AGGGCCCAGC TGTACATCGA CTGTGAAAAG ATGGAGAATG CTGAGTTGGA CGTCCCCATC 660 

CAAAGCGTCT TCACCAGAGA CCTGGCCAGC ATCGCCAGAC TCCGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTGAGGT TTGTCTTTGG AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA GTGTCCTCCT CACCCTTGAC 840 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCCGCACTA ACTACATTGG CCACAAGACA 900 

AAGGACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

CTCAGGGGCC TGCOCACCAT TGTGACCACG CTGCAGGACA GCATCCGCAA AOTGACTGAA 1020 

GAGAACAAAG AGTTGGCCAA TGAGCTGAGG CGGCCTCCCC TATGCTATCA CAACGGAGTT 1080 

CAGTACAGAA ATAACGAGGA ATGGACTGTT GATAGCTGCA CTGAGTGTCA CTGTCAGAAC 1140 

TCAGTTACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTGCTCCAA TGCCACAGTT 1200 

CCTGATGGAG AATGCTGTCC TCGCTGTTGG CCCAGCGACT CTGCGGACGA TGGCTGGTCT 1260 

CCATGGTCCG AGTGGACCTC CTGTTCTACG AGCTGTGGCA ATGGAATTCA GCAGCGCGGC 1320 

CGCTCCTGCG ATAGCCTCAA CAACCGATGT GAGGGCTCCT CGGTCCAGAC ACGGACCTGC 1380 

CACATTCAGG AGTGTGACAA AAGATTTAAA CAGGATGGTG GCTGGAGCCA CTGGTCCCCG 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTGATCA CAAGGATCOG GCTCTGCAAC 1500 
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4 

TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG AAGCGCGGQA OACCAAAGCC 1560 

TGCAAGAAAG ACGCCTGCCC CATCAATGGA OGCTGGGGTC CTTGGTCACC ATGGGACATC 1620 

TCTTCTGTCA CCTGTGGAGG AGGGGTACAG AAAOGTAGTC QTCTCTGCAA CAACCCCGCA 1680 

CCCCAGTTTG GAGGCAAGGA CTGCGTTQGT GATGTAACAQ AAAACCAGAT CTGCAACAAG 1740 

5 CAGGACTGTC CAATTGATCG ATGCCTGTCC AATCCCTGCT TTGCCQQGGT GAAGTGTACT 1800 

AGCTACCCTG ATGGCAOCTG QAAATGTG3T GCTTGTCCCC CTGGTTACAQ TGGAAATGGC 1860 

ATCCAGTGCA CAGATGTTGA TGAGTGCAAA GAAGTGCCTG ATGCCTGCTT CAACCACAAT 1920 

GGAGAGCACC GGTGTGAGAA CACGGACCCC GGCTACAACT GCCTGCCCTG CCCCCCACQC 1980 

TTCACCGGCT CACAGCCCTT CGGCCAGGGT GTCGAACATG CCACGGCCAA CAAACAOGTG 2040 

10 TGCAAGCCCC GTAACCCCTG CACGGATGGG ACCCACGACT GCAACAAGAA CGCCAAGTGC 2100 

AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGCG AGTGCAAGCC TGGCTACGCT 2160 

GGCAATGGCA TCATCTGCGG GGAGGACACA GACCTGGATG GCTGGCCCAA TGAGAACCTG 2220 

GTGTGCGTGG CCAATGCGAC TTACCACTGC AAAAAGGATA ATTGCCCCAA CCTTCCCAAC 2280 

TCAGGGCAGG AAGACTATGA CAAGGATGGA ATTGGTGATG CCTGTGATGA TGACGATGAC 2340 

15 AATGATAAAA TTCCAGATGA CAGGGACAAC TOTCCATTCC ATTACAACCC AGCTCAGTAT 2400 

GACTATGACA GAGATGATOT GGGAGACCGC TGTGACAACT GTCCCTACAA CCACAACCCA 2460 

GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT GTGCTGCAGA CATTGATGGA 2520 

GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT ACAATGTGGA CCAGAGAGAC 2560 

ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC CCTTGGAACA CAATCCGOAT 2640 

20 CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG ACAACAATCA GGATATTGAT 2700 

GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG TGCCCAATGC CAACCAGGCT 2760 

GACCATGACA AAGATGGCAA ' GGGAGATGCC TGTGACCACG ATGATGACAA CGATGGCATT 2820 

CCTGATGACA AGGACAACTG CAGACTOGTG CCCAATCCCG ACCAGAAGGA CTCTGACGGC 2880 

GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG ACAGTGTGCC AGACATCGAT 2940 

25 GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT TCGGCCGATT CCAGATGATT 3000 

CCTCTGGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT GGGTTGTACG CCATCAGGGT 3060 

AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTOG CTGTAGGTTA TGATGAGTTT 3120 

AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG AAAGGGACGA TGACTATGCT 3180 

GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG TTGTGATGTG GAAGCAAGTC 3240 

30 ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG GATACTCGGG CCTTTCTGTG 3300 

AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAGCACCTGC GGAACGCCCT GTGGCACACA 3360 

GGAAACACCC CTGGCCAQGT GCGCACCCTG TGGCATGACC CTCGTCACAT AGGCTGGAAA 3420 

GATTTCACCG CCTACAGATG GCGTCTCAGC CACAGGCCAA AGACGGGTTT CATTAGAGTG 3480 

GTGATGTATG AAGGGAAGAA AATCATGGCT GACTCAGGAC CCATCTATGA TAAAACCTAT 3540 

35 GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA TGGTGTTCTT CTCTGACCTG 3600 

AAATACGAAT GTAGAGATCC CTAATCATCA AATTGTTGAT TGAAAGACTG ATCATAAACC 3660 

AATGCTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGAA AACCCCCAGG ATCACTTCTC 3720 

CTTGGCTTCC TTCTTTTCTG TGCTTGCATC AGTGTGGACT CCTAGAACGT GCGACCTGCC 3780 

TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT CAGCCTCCAA TGAATAAGAC 3840 

40 ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG AAAAAGCATC TACTTGCTTC 3900 

AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA GCAGGGTGCT ATTGTGAGGC 3960 

CATCTCTGAG CAGTGGACTC AAAAGCATTT TCAGGCATGT CAGAGAAGGG AGGACTCACT 4020 

AGAATTAGCA AACAAAACCA CCCTGACATC CTCCTTCAGG AACACGGGGA GCAGAGGCCA 4080 

AAGCACTAAG GGGAGGGCGC ATACCCGAGA CGATTGTATG AAGAAAATAT GGAGGAACTG 4140 

45 TTACATGTTC GGTACTAAGT CATTTTCAGG GGATTGAAAG ACTATTGCTG GATTTCATGA 4200 

TGCTGACTGG CGTTAGCTGA TTAACCCATG TAAATAGGCA CTTAAATAGA AGCAGGAAAG 4260 

GGAGACAAAG ACTGGCTTCT GGACTTCCTC CCTGATCCCC ACCCTTACTC ATCACCTTGC 4320 

AGTGGCCAGA ATTAGGGAAT CAGAATCAAA CCAGTGTAAG GCAGTGCTGG CTGCCATTGC 4380 

CTGGTCACAT TGAAATTGGT GGCTTCATTC TAGATGTAGC TTGTGCAGAT GTAGCAGGAA 4440 

50 AATAGGAAAA CCTACCATCT CAGTGAGCAC CAGCTGCCTC CCAAAGGAGG GGCAGCCGTG 4500 

CTTATATTTT TATGGTTACA ATGGCACAAA ATTATTATCA ACCTAACTAA AACATTCCTT 4560 

TTCTCTTTTT TCCGTAATTA CTAGGTAGTT TTCTAATTCT CTCTTTTGGA AGTATGATTT 4620 

TTTTAAAGTC TTTACGATGT AAAATATTTA TTTTTTACTT ATTCTGGAAG ATCTGGCTGA 4680 

AGGATTATTC ATOGAACAGG AAGAAGCGTA AAGACTATCC ATGTCATCTT TGTTGAGAGT 4740 

55 CTTCGTGACT GTAAGATTGT AAATACAGAT TATTTATTAA CTCTGTTCTG CCTGGAAATT 4800 

TAGGCTTCAT ACGGAAAGTG TTTGAGAGCA AGTAGTTGAC ATTTATCAGC AAATCTCTTG 4860 

CAAGAACAGC ACAAGGAAAA TCAGTCTAAT AAGCTGCTCT GCCCCTTGTG CTCAGAGTGG 4920 

ATGTTATGGG ATTCCTTTTT TCTCTGTTTT ATCTTTTCAA GTGGAATTAG TTGGTTATCC 4980 

ATTTGCAAAT GTTTTAAATT GCAAAGAAAG CCATGAGGTC TTCAATACTG TTTTACCCCA 5040 

60 TCCCTTGTGC ATATTTCCAG GGAQAAGGAA AGCATATACA CTTTTTTCTT TCATTTTTCC 5100 

AAAAGAGAAA AAAATGACAA AAGGTGAAAC TTACATACAA ATATTACCTC ATTTGTTGTG 5160 

TGACTGAGTA AAGAATTTTT GGATCAAGCG GAAAGAGTTT AAGTGTCTAA CAAACTTAAA 5220 

GCTACTGTAG TACCTAAAAA GTCAGTGTTO TACATAGCAT AAAAACTCTG CAGAGAAGTA 5280 

TTCCCAATAA GGAAATAGCA TTGAAATGTT AAATACAATT TCTGAAAGTT ATGTTTTTTT 5340 

65 TCTATCATCT GGTATACCAT TGCTTTATTT TTATAAATTA TTTTCTCATT GCCATTGGAA 5400 

TAGAATATTC AGATTGTGTA GATATGCTAT TTAAATAATT TATCAGGAAA TACTGCCTGT 5460 

AGAGTTAGTA TTTCTATTTT TATATAATGT TTGCACACTG AATTGAAGAA TTGTTGGTTT 5520 

TTTCTTTTTT TTGTTTTTTT TTT TTT T T T T TTTTTTTTTG CTTTTGACCT CCCATTTTTA 5580 

CTATTTGCCA ATACCTTTTT CTAGGAATGT GCTTTTTTTT GTACACATTT TTATCCATTT 5640 

70 TACATTCTAA AGCAGTGTAA GTTGTATATT ACTGTTTCTT ATGTACAAGG AACAACAATA 5700 
AATCATATGG AAATTTATAT TT 

8eq ID NO: 38 Protein s equence; 
Protein Accession #: NP_003237 
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1 11 21 

I I I 

KGLAWGLOVL FLMHVCGTNR IPESGGDNSV 
ISDAMLIPPV PDDKFODLVD AVRASKQPLL 
SNGKAGTLDL SLTVQGKQHV V8VEEALLAT 
VPIQSVPTRD LAS I ARLR1A KGGVNDNFQG 
TLDNNWHOS SPAIRTNYIG HKTKDLQAIC 
VTBENKELAN ELRRPPLCYH NGVQYRNNBE 
ATVPDGECCP RCWPSDSADO GWSFWSEWTS 
RTCHIQBCDK RFKQDGGWSH WSPWSSCSVT 
TKACKXDACP INGGWGPWSP WDICSVTCGG 
CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW 
NHNGEHRCEN TDPGVNCIiPC PPRFTGSQPP 
AKCNYLGHYS DPMYRCECKP GYAGNGIICG 
LPNSGQEDYD KDGIGDACDD DDDNDKIPDD 
HNPDQADTDN NGEGDACAAD IDGDGILNER 
NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN 
DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC 
QMIPLDPKGT SQNDPNWWR HQGKBLVQTV 
DYAGPVFGYQ SSSRFYWMW KQVTQSYWDT 
WHTGNTPGQV RTLWHDPRHI GWKDPTAYRW 
KTYAGGRLGL FVFSQEMVFF SDLKYECRDP 



31 41 51 

I I i 

FDIFKLTGAA RKGSGRRLVK GPDPSSPAFR 60 

LASLRQMKKT RGTLLALERK DHSGQVFSW 120 

GQWKSITLFV QEDRAQLYXD CEKMENAELD 180 

VLQtJVRFVFG TTPEDILRNK GCSSSTSVLL 240 

GISCDELSSM VLELRGLRTI VTTLQDSIRK 300 

WTVDSCTBCH CQNSVTICKK VSCPIMPCSN 360 

CSTSOONGIQ QRGRSCDSW NRCEGSSVQT 420 

CGDGVITRIR LCNSPSPOMN GKPCEGEARE 4 BO 

GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

KCGACPPGYS GNGIQCTDVD ECKEVPDACF 600 

GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

RDNCPFHYNP AQYDYDRDDV GDRCDNCPYK 780 

DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

KDDFDHDSVP DIDDICPENV DISBTDPRRP 960 

NCDPGIAVGY DBFNAVDPSG TFPINTERDD 1020 

NPTRAQGYSG LSVKWNSTT GPGBHLRNAL 1080 

RLSHRPKTGP IRWMYBGKK IMADSGPIYD 1140 



Seq ID NO; 39 Nucleotide, sequence} 
Nucleic Acid Accession fh BC004299 

Coding sequence: 6 9.. 123 5 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCCGACCCGT GCGAGGGCCA GGTCCGCGCC TGCCCCGCCA GGCGAAGCGA GGCGACCCGC 60 

GTGCGGCCAT fiGCTTCGCTG CTGGGAGCCT ACCCTTGGCC CGAGGGTCTC GAGTGCCCGG 120 

CCCTGGACGC CGAGCTGTCG GATGGACAAT OGCCGCCGGC CGTCCCCCGG CCCCCGGGGG 180 

ACAAOGGCTC CGAGAGCCGT ATCCGGCGGC CCATGAACGC CTTCATGGTT TGGGCCAAGG 240 

ACGAGAGGAA ACGGCTGGCA GTGCAGAACC CGGACCTGCA CAACGCCGAG CTCAGCAAGA 300 

TGCTGGGAAA GTOGTGGAAG GCGCTGACGC TGTCCCAGAA GAGGCCGTAC GTGGACGAGG 360 

CGGAOCGGCT GCGCCTGCAG CACATGCAGG ACTACCCCAA CTACAAGTAC CGGCCGCGCA 420 

GGAAGAAGCA GGCCAAGCGG CTGTGCAAGC GOGTGGACCC GGGCTTCCTT CTGAGCTCCC 480 

TCTCCCGGGA CCAGAACGCC CTGCCGGAGA AGAGAAGCGG CAGCCGGGGG GCGCTGGGGG 540 

AGAAGGAOGA CAGGGGTGAG TACTCCCCCG GCACTGCCCT GCCCAGCCTC CGGGGCTGCT 600 

ACCACGAGGG GCCGGCTGGT GGTGGCGGCG GCGGCACCCC GAGCAGTGTG GACACGTACC 660 

CGTACGGGCT GCCCACACCT CCTGAAATGT CTCCCCTGGA CGTGCTGGAG CCGGAGCAGA 720 

CCTTCTTCTC CTCCCCCTGC , CAGGAGGAGC ATGGCCATCC CCGCCGCATC CCCCACCTGC 780 

CAGGGCACCC GTACTCACCG GAGTACGCCC CAAGCCCTCT CCACTGTAGC CACCCCCTGG 840 

GCTCCCTGGC CCTTGGCCAG TCCCCCGGCG TCTCCATGAT GTCCCCTGTA CCCGGCTGTC 900 

CCCCATCTCC TGCCTATTAC TCCCCGGCCA CCTACCACCC ACTCCACTCC AACCTCCAAG 960 

CCCACCTGGG CCAGCTTTCC CCGCCTCCTG AGCACCCTGG CTTCGACGCC CTGGATCAAC 1020 

TQAGCCAGGT GGAACTCCTG GGGGACATGG ATCGCAATGA ATTCGACCAG TATTTGAACA 1080 

CTCCTGGCCA CCCAGACTCC GCCACAGGGG CCATGGCCCT CAGTGGGCAT GTTCCGGTCT 1140 

CCCAGGTGAC ACCAACGGGT CCCACAGAGA CCAGCCTCAT CTCOGTCCTG GCTGATGCCA 12 00 

OGGCCACGTA CTACAACAGC TACAGTGTGT CATAGAGCTG GAGGCGCCCC GTCCGGTCAQ 1260 

CCCTCGCGCC CTCTCCTTCT TGTGCCTTGA GTGGCAGAGG AGCCGTCCAG CCACACCAGC 1320 

TTTCCTCCCA COGCTCAGGG CAGGOAGGTC TGAACTGCGG CCCCAGAGCC TTTGGCCTAA 1380 

GCTGGACTCT CCTTATCCGA GTGCCGCCTC TATCCCCTTC CCCACGTTCC AGCCCCTGCA 1440 

GCCCACATTT TAAGTATATT CCTTCAAGTG AGTTTTCCTC CAGCCCCTGA GAGTTGCTGT 1500 

CTCCCAGTGG AATGTTCACT GACGTCTTTT CTTGGTAGCC ATCATCGAAA CTAATGGGGG 1560 

GACAGACTTG ATAGCCAAGG TCCCTTCTGG TCCAGTTTTC TGATTTAGGG TTCTCTCAAG 1620 

ATTAATAAAG GAAGATGGGG AAATTTGACT CATTAATGAG CTCGCTAACC TACGATCTGG 1680 

TGATAATTTT GTGTGCACAG CCCAAGGACC ACGAGGCTTT CTGCACTTTC TGCACCCCCT 1740 

TCCAAAGTGA CCACAAAATT TCAAAGGGAC TCATACAATT TGAGAAAAAA CAGTCAACCT 1800 

GATTTGAGAA ATTAACCAGT ATGGCTAACT ATATCACAGA AAATGGGATT GAGTTAAAAC 1860 

TATTTTATTT TAAATATACA TTTTAAAGCA GTTCTTTTTT TTTGTTAATT TGTTTATTAT 1920 

ACACACACTT CAAGAGCCAC CGCGCCCAGC CTACATTTAT AATTTTCATT CTCTTTTACC 1980 

TATAAAATTC AGTGTATTAG TTTCATTACA TAGGAGAAAT TATATTTCTA AACATTTTAT 2040 
GATGTTTAAA AACAAAACAG GCTGTTGTAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 40 Protein sequence; 
Protein Accession 8t AAH04299 

I 11 21 31 41 51 

II I I I I 

MASLLGAYPW PEGLECPALD AELSDGQSPP AVPRPPGDKG SESRIRRPMN AFMVWAKDER 60 

KHIAVQNPDL HNAELSKMLG KSWKALTLSQ KRPYVDEAER LRLQHHQDYP NYKYRPRRKK 120 

QAKRLCKRVD PGFLLSSLSR DQNALPEKRS GSRGALGEKE DRGEYSPGTA LPSLRGCYHB 180 
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GPAGGGGGGT PSSVDTYPYG LPTPPBHSPL DVLEPEQTFF 6SPCQEEHGH PRRIPHLPGH 240 

PYSPBYAPSP LHCSHPI/3SL AUGQSPCVSM HSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 

GQLSPPPBHP GFDALDQLSQ VELLGDMDRN EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSL ISVLADATAT YYNSYSVS 



Seq ID NO: 41 N W c;ieot;Lde sequence^ 
Nucleic Acid Accession #; NM_O04449 

Coding sequence: 1..13 99 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

AT£ATTCAGA CTGTCCCGGA CCCAGCAGCT 
GAGGACCAGT CGTTGTTTGA 6TGTGCCTAC 
ACOGCGTCCT CCTCCAGCGA CTATQGACAG 
CAGGATTGGC TGTCTCAACC CCCAGCCAGO 
CAGGTGAATQ GCTCAAGGAA CTCTCCTQAT 
GTGGGCAGCC CAGACACCGT TGGGATGAAC 
CCACCCCCAA ACATGACCAC GAACGAGCGC 
TGGAGTACAG ACCATGTGOO GCAGT6GCT6 
GACGTCAACA TCTTGTTATT CCAGAACATC 
GACGACTTCC AGAGGCTCAC CCCCAGCTAC 
TACCTCAGAG AGACTCCTCT TCCACATTTG 
AACTCTCCAC GGTTAATGCA TGCTAGAAAC 
TCAGCCTGGA CCGGTCACGG CCACCCCACG 
TCCACAGTGC CCAAAACTGA AGACCAGCGT 
CCAACAAGTA GCCGCCTTGC AAATCCAGGC 
CTGGAGCTCC TGTCGGACAG CTCCAACTCC 
GAGTTCAAGA TGACGGATCC CGACGAGGTG 
CCCAACATGA ACTACGATAA GCTCAGCCGC 
ATGACCAAGG TCCATGGGAA GOGCTACGCC 
GCCCTCCAGC CCCACCCCCC GGAQTCATCT 
ATGGGCTCCT ATCACGCCCA CCCACAGAAG 
CTCCCCGTGA CATCTTCCAG TTTTTTTGCT 
GGGGGTATAT ACCCCAACAC TAGGCTCCCC 
TACTAC TAA 



Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP_004440 



31 41 51 

I I I 

CATATCAAGG AAGCCTTATC AGTTGTGAGT 60 

GGAACGCCAC ACCTGGCTAA GACAGAGATG 120 

ACTTCCAAGA TGAGCCCACG CGTCCCTCAG 180 

GTCACCATCA AAATGGAATG TAACCCTAGC 240 

GAATGCAGTG TGGCCAAAGG CGGGAAGATG 300 

TACGGCAGCT ACATGGAGGA GAAGCACATG 360 

AGAGTTATCG TGCCAGCAGA TCCTACGCTA 420 

GAGTGGGCGG TGAAAGAATA TGGCCTTCCA 4 80 

GATGGGAAGG AACTGTGCAA GATGACCAAG 540 

AACGCCGACA TCCTTCTCTC ACATCTCCAC 600 

ACTTCAGATG ATGTTGATAA AGCCTTACAA 660 

ACAGATTTAC CATATGAGCC CCCCAGGAGA 720 

CCCCAGTCGA AAGCTGCTCA ACCATCTCCT 780 

CCTCAGTTAG ATCCTTATCA GATTCTTGGA 840 

AGTGGCCAGA TCCAGCTTTG GCAGTTCCTC 900 

AGCTGCATCA CCTGGGAAGG CACCAACGGG 960 

GCCCGGCGCT GGGGAGAGCG GAAGAGCAAA 1020 

GCCCTCCGTT ACTACTATGA CAAGAACATC 1080 

TACAAGTTCG ACTTCCACGG GATCGCCCAG 1140 

CTGTACAAGT ACCCCTCAGA CCTCCCGTAC 1200 

ATGAACTTTG TGGCGCCCCA CXXTCCAGCC 1260 

GCCCCAAACC CATACTGGAA TTCACCAACT 1320 

ACCAGCCATA TGCCTTCTCA TCTGGGCACT 1380 



1 11 21 

I I I 

MIQTVPDPAA KIKEALSWS BDQSLFECAY 
QDWLSQPPAR VTIKMBCNPS QVNGSRNSPD 
PPPNMTTNER RVIVPADPTL WSTDHVRQWL 
DDFQRLTPSY NADILLSHLH YLRETPLPHL 
SAWTGHGHPT PQSKAAQPSP STVPKTEDQR 
IiELLSDSSNS SCITWBGTNG EFKMTDPDEV 
MTKVKGKRYA YKFDPHGIAQ ALQPHPPESS 
LPVTSSSFFA APNPYWNSPT GGIYPNTRLP 



31 


41 


51 




1 

GTPHLAKTEH 


1 

TASESSDYGQ 


1 

TSKMSPRVPQ 


60 


BCSVAKGGKM 


VGSPDTVGMN 


YGSYMEEKHM 


120 


EWAVKEYOLiP 


DVNILLFQNI 


DGKELCKMTK 


180 


TSDDVDKALQ 


NSPRLMHARN 


TDLFYEPPRR 


240 


PQLDPYQILG 


PTSSRLANPG 


SGQIQItWQFL 


300 


ARRWGBRKSK 


PNMNYDKLSR 


ALRYYYDKNI 


360 


LYKYPSOLPY 


HGSYHAHPQK 


MNFVAPHPPA 


420 


TSHMPSHLGT 


YY 







Seq ID NOi 43 Nucleotide sequence; 
Nucleic Acid Accession #: NM_005100 

Coding sequence: 192.. 5 53 7 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCTTCTTTTA AGGAGTTTGC CGCGAGCGCG TCTCCTTCAT TCGCAGGCTG GGCQCGTTCG 60 

CAGTOGGCTG GCGGCGAAGG AAGGCGCTCT CGGGACCTCA OGGGCGCGCG TCTTTTGGCT 120 

CTTGCCCCTG TCCCTGCGGC TTGGGGAAAG CGTAACCCGG CGGCTAGGCG CGGGAGAAGT 180 

GCGGAGGAGC Ca3SGQCGCC GGGAGCTCCA CCGAGCAGCG CAGCCCGGAG CAGCCGCCCG 240 

AGGGGAGCTC CACGCCGGCT GAGCCCGAGC CCAGCGGCGG CGGCCCCTCG GCCGAGGCGG 300 

CGCCAGACAC CACCGCGGAC CCCGCCATOG CTGCCTCGGA CCCCGCCACC AAGCTCCTAC 360 

AGAAGAATGG TCAOCTGTCC ACCATCAATG GCGTAGCTGA GCAAGATGAG CTCAGCCTCC 420 

AGGAGGGTGA CCTAAATGGC CAGAAAGGAG CCCTGAACGQ TCAAGGAGCC CTAAACAGCC 4 80 

AGGAGGAAGA AGAAGTCATT GTCACGGAGG TTGGACAGAG AGACTCTGAA GATGTGAGCG 540 

AAAGAGACTC CGATAAAGAG ATGGCTACTA AGTCAGCGGT TGTTCACGAC ATCACAGATG 600 

ATGGGCAGGA GGAGAACCGA AATATOGAAC AGATTCCTTC TTCAGAAAGC AATTTAGAAG 660 

AGCTAACACA ACCCACTGAG TCCCAGGCTA ATGATATTGG ATTTAAGAAG GTGTTTAAGT 720 

TTGTTGGCTT TAAATTCACT GTGAAAAAGG ATAAGACAGA GAAGCCTGAC ACTGTCCAGC 780 
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TACTCACTGT GAAGAAAGAT GAAGGGGAGG 
CCAGCCTTGG GGCTGGAGAA GCAGCATCCA 
AACCCGAAGA GACCCTGAAG CGTGAGCAAA 
CTGGCCAAGC AGTGGAGGAA TGCAAAGAGG 
5 GCAAGTCTGC AGAATCTCCG ACTAGTCCCG 
AATTCTTCAC TCAAGOTTGG GCCGGCTGGC 
AGGATGAAGT GGAAOCTTCA GAGAAGAAAA 
AAGAAGACGG AAAGGCAGAG GTTGCCTCCG 
CACAGGAGCC GGCAGAAAGT GCCCACGAGC 

10 AGCTGCCCTC AGAGGAGCAA GTCAGTGGCT 
CGTTGGCGAC AGAAGTGTTT GATOAGAAAA 
AAGTCCACGT CAGCACCGTG GAGGAGAGAA 
CAGCAGGGTC TGTGCCAGCT GAAGAATTGG 
AACCTGCCAA GGAGCTGGTG AAGCTCAAAG 

15 AGGGAGCTGA CCTCAGTCCT GATGAGAAGG 
GTGAGGTGGA AATGCTGTCA TCACAGGAQA 
AGCTTTTTAC CAGCACTGGC TTAAAAAAQC 
GAGGAGGAGA CGAGGAATCA GGGGAGCACA 
AGGAGGAGCA AAAGGGCGAG AGCTCTGCCT 

20 GTCTGGAAAA GGGCTTAGCC GAGGTGCAGC 
CCGATGGAGA GAAAAAAAGA GAAGGTGTCA 
CGCCCAAGAA GCGTGTTAGA CGGCCTTCGG 
TCAAGAGCGC TACCTTGTCT TCCACCGAGA 
AAGGGAGCGT GGAAGAGCCA AAGCCGGAAG 

25 . CTTGGGAAGC TTTAATTTGT GTGGGATCAT 
CTGATGAGGA AGGGGGACCA AAAGCAATGG 
GAAAAGACAA AGAGACGGGG ACAGACGGGA 
GGCAGGGAAG TTCCTCCCCG GAGCAAGCTG 
CCTGGGAGTC ATTTAAAAGG TTAGTCACGC 

30 AGAAAAGCGA AGACTCCATA GCTGGGTCTG 
CCGGTAAAGA AGAATCCTGG GTCTCAATCA 
GGCCAGATGG GAAACAAGAA CAAGCCCCTG 
AAGATGACTC TGATGTCCCG GCCGTGGTCC 
AGAAAATGGA GGCACAGCAA GCCCAAAAAG 

35 CTGAGGTGTC CAAGGAGCTC AGCGAGAGTC 
ACGGGACGAG GGCAGCTACC ATTATTGAAG 
TGACAGAACC TCTTGAACAA GTAGAAGCTG 
AAAGAGAAGT AATTGCAOAA GAAGAACCCC 
GAGAGGCCCG GGGCGACACQ GTCGTTAGTG 

40 CTGCAGAAAC TGCAGGGCCA TTGGGTTCCG 
AGACCACAGA AATGOTQTCA GCAGTCTCCC 
AGGCCACTCC GGTGCAGGAG GTGGAAGGTG 
GGACTCAAGA GGTCCTCCAG GCAGTGGCAG 
GCACCGGTGG GCCAGAAGAT GTGCTTCAGC 

45 AAGAGCAGGC TGAAGCGTCG GGTCTGAAGA 
CTCAGGAGGC AAAAACTGAG CCTTTTACAC 
AAAGCTTTGA AAAAGCTCCT CAAGTCACAG 
CTTGTCAAGC CGAAACCTTA GCTGGGGTAA 
TCCCCCCTGA CTCGOTGGAA ACCCCTACAG 

50 CCGACTTTGA CGCACCAGGC ACAACCCAGA 
ATGAGGTCGC ATCTGGTACC CAGTCAGGGG 
AAGAGAGGCC TCCAGCACCT TCCAGTTTTG 
AGATGGAAGA CACTCTAGAG CATACAGATA 
TGTCAAAGAC TGAGGGGACT CAAGAGGCTG 

55 TACCATTTTT CGAAGGACTT GAGGGGTCTA 
AGGTCACTGA AGTTGCCCTT AAAGGTGAAG 
ATGCTCTTGA ACTGCAGAGT CACGCTAAGT 
TAGTTCAAGT CGAAAGGGAG AAAACAGAAG 
TTGAGCACGA AACAGCTGTT ACCGTATCTG 

60 TGAATGTGCC CATCATAGAT GGGGCAAAGG 
CCTGCCTAGG TCAAGAGGAG GCAGTATGCA 
CATTCACTCT AACAGCGGCT GCAGAGGAGG 
TAGAAACAGG TGAAACGTTG GAGCCTGCAG 
CTGAAAAAAA TGAAGACTTT GCCGCTCATC 

65 ACTGTCAGGC AAAATCGACA CCAGTGATAG 
CCGACCTGGA AGGAGAGAAA ACCACATCAC 
AGGTTGCTTG CCAGGAGGTC AAAGTGAGTG 
GGATTTTGGA ACTTGAGACC AAAAGCAGTA 
TTGACCAGTT TGTACGTACA GAAGAAACAG 

70 CACAAGCTCA CGTGATAAAA GCTGACAGCC 
GAGAGGAACC TCAGGCCTCT GCACAGGATG 
CAGAGTCAAC CGCAGTGGGA CAAGCACATT 
CAGAAAAGAC CATGACTGTT GAGGTAGAAG 
AGGTOGTCCT CCCATCTGAG GAAGAGGGAG 

75 ATGATGGTCA TGCCTTGTTA GCAGAAAGAA 
ATGAAAAAGG TGATGATGTT GATGACCCTG 



GAGCAGCAGG GGCTGGCGAC CACCAGGACC 640 

AAGAAAGCGA ACCCAAACAA TCTACAGAGA 900 

GCCACGCAGA AATTTCTCCC CCAGCGGAAT 960 

AAGGAGAAGA GAAACAAGAA AAAGAACCTA 1020 

TGACCAGTGA AACAGGATCA ACCTTCAAAA 1060 

GCAAAAAGAC CAGTTTCAGG AAGCCGAAGG 1140 

AGGAACAAGA GCCAGAAAAA GTAGACACAG 1200 

AGAAACTGAC CGCCTCCGAG CAAGCCCACC 1260 

CCCGGTTATC AGCTGAATAT GAGAAAGTTG 1320 

CGCAGGGACC TTCTGAAGAG AAACCTGCTC 1380 

TAGAAGTCCA CCAAGAAGAG GTTGTGGCCG 1440 

CCGAAGAGCA GAAAACGGAG GTGGAAGAAA 1500 

TTGGAATGGA TGCAGAACCT CAGGAAGCCG 1560 

AAACGTGTGT TTCCGGAGAG GACCCTACAC 1620 

TGCTGTCCAA ACCCCCCGAA GGCGTTGTGA 1680 

GAATGAAGGT GCAGGGAAGT CCACTAAAGA 1740 

TTTCTGGAAA GAAACAGAAA GGGAAAAGAG 1800 

CTCAGGTTCC AGCCGATTCT CCGGACAGCC 1860 

CATCCCCTGA GGAGCCCGAG GAGATCACGT 1920 

AGGATGGGGA AGCTGAAGAA GGAGCTACTT I960 

CTCCCTGGGC ATCATTCAAA AAGATGGTGA 2040 

AAAGTGATAA AGAAGATGAG CTGGACAAGG 2100 

GCACAGCCTC TGAAATGCAA GAAGAAATGA 2160 

AACCAAAGCG CAAGGTGGAT ACCTCAGTAT 2220 

CCAAGAAAAG AGCAAGGAGA AGGTCCTCTT 2280 

GAGGAGACCA CCAGAAAGCT GATGAGGCCG 2340 

TCCTTGCTGG TTCCCAAGAA CATGATCCAG 2400 

GAAGCCCTAC CGAAGGGGAG GGCGTTTCCA 2460 

CAAGAAAAAA ATCAAAGTCC AAGCTGGAAG 2520 

GTGTAGAACA TTCCACTCCA GACACTGAAC 2580 

AGAAGTTTAT TCCTGGACGA AGGAAGAAAA 2640 

TTGAAGACGC AGGGCCAACA GGGGCCAACG 2700 

CTCTGTCTGA GTATGATGCT GTAGAAAGGG 2760 

GCGCAGAGCA GCCCGAGCAG AAGGCAGCCA 2820 

AGGTTCATAT GATGGCAGCA GCTGTCGCTG 2880 

AAAGGTCTCC TTCTTGGATA TCTOCTTCAG 2940 

AAGCCGCACT GTTAACTGAG GAGGTATTGG 3000 

CCACGGTTAC TGAACCTCTG CCAGAGAACA 3060 

AGGCGGAATT GACCCCCGAA GCTGTGACAG 3120 

AAGAAGGAAC CGAAGCATCT GCTGCTGAAG 3180 

AGTTAACCGA CTCCCCAGAC ACCACAGAGG 3240 

GCGTACCTGA CATAGAAGAG CAAGAGAGGC 3300 

AAAAAGTGAA AGAGGAATCC CAGCTGCCTG 3360 

CTGTGCAGAG AGCAGAGGCA GAAAGACCAG 3420 

AAGAGACGGA TGTAGTGTTG AAAGTAGATG 3480 

AAGGGAAGGT GGTGGGGCAG ACCACCCCAG 3540 

AGAGCATAGA GTCCAGTGAG CTTGTAACCA 3600 

AATCACAGGA GATGGTGATG GAACAGGCTA 3660 

ACAGTGAGAC TGATGGAAGC ACCCCCGTAG 3720 

AAGACGAGAT TGTGGAAATC CATGAGGAGA 3780 

GCACAGAAGC AGAGGCAGTT CCTGCACAGA 3840 

TGTTCCAGGA AGAAACTAAA GAACAATCAA 3900 

AAGAGGTGTC AGTGGAAACT GTATCCATTC 3960 

ACCAGTATGC TGATGAGAAA ACCAAAGACG 4020 

TAGACACAGG CATAACAGTC AGTCGGOAAA 4080 

GGACAGAAGA AGCTGAATGT AAAAAGGATG 4140 

CTCCTCCATC CCCCGTGOAG AGAGAGATGG 4200 

CAGAGCCAAC CCATGTGAAT GAAGAGAAGC 4260 

AAGAGGTCAG TAAGCAGCTC CTCCAGACAG 4320 

AAGTCAGCAG TTTGGAAGGA AGCCCTCCTC 4380 

CCAAAATTCA AGTTCAGAGC TCTGAGGCAT 4440 

AAAAGGTCTT AGGAGAAACT GCCAACATTT 4500 

GTGCACATTT AGTTCTGGAA GAGAAATCCT 4560 

CAGGGGAAGA TGCTGTGCCC ACAGGGCCCG 4620 

TATCTGCTAC TACCAAGAAA GGCTTAAGTT 4680 

TGAAGTGGAA GTCAGATGAA GTCGATGAGC 4740 

TAGCAATTGA GGATTTAGAG CCTGAAAATG 4800 

AACTTGTCCA AAACATCATC CAGACAGCCG 4860 

CCACCGAAAT GTTGACGTCT GAGTTACAGA 4920 

AGGACGCTGG ACAGGAAACG GAGAAAGAAG 4980 

AAACACCAAT TACTTCAGCC AAAGAGGAGT 5040 

CTGATATTTC CAAAGACATG AGTGAAGCCT 5100 

GTTCCACTGT AAATGATCAG CAGCTGGAAG 5160 

GTGGAGCTGG AACAAAGTCT GTGCCAGAAG 5220 

TAGAGAAGTC ACTAGTTGAA CCGAAAGAAG 5280 

AAAACCAGAA CTCAGCCCTG GCTGATACTG 5340 
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10 



15 



20 



ATGCCTCAGG 
AGGAGGATGC 
CGATCACACC 
AACTTACAGA 
TGTGAAGACA 
AGAACTTTGA 
CAATCCTGAG 
CTACAATTTT 
GAACTGGAGT 
TTATATGTAT 
AGGAAATOTG 
CATAGTGCAG 
CCTAGATGTA 
CATATTTAOT 
AAAAGCATTT 
CAQTGGAGGT 
ACTTGTTATO 
OTTTTATATO 
GACTGCATTC 
TACTGGATTG 
TCATTTGGAT 



AGGCTTAACC 
CCAGGAAQTA 
CCAAGCAjCAO 
ATCTTAAAAC 
AGTAOTAGAA 
GAATTGGAGA 
GC1TCATCAO 
CCCTTGATAA 
TGGCAATACC 
GTTTTAAGTA 
CAGGATACTA 
GCTTGGGGAG 
ACATTCCTGA 
GGTACTCTGA 
TGAAACATAC 
TAGAAAGAAO 
GTTGTTTGGA 
TTATOAAGAA 
ACACATGGCA 
TGTTTGTGCC 
AAGTTGTGAT 



AAAGAGTCCC 
GAATTGCAGG 
GAGGAGTTAC 
ATCATGCAGT 
GAAAATGAAT 
GCAGGCACAT 
GAGCTAGAGC 
CCATATAAAT 
TAGTTCTGCT 
GTCCTCCTGT 
CATGCTTTTT 
CTTTAAGCCT 
TCAAGGTACA 
AATTGGTCAC 
AGAATGTTCT 
TTATATTCTO 
CCGATAAGTG 
AAGAATTGTT 
TGAAATAAGT 
ATATTTGTGC 
TTGACGACTG 



CAGATACAAA 
AAGGAAAAGT 
AGAAACAAGA 
TAAACTCATT 
GCTGCTGCTG 
CAACTGATCT 
CATTTAACAT 
TCTGATTTAA 
TCTGAAACTG 
ATCTATTGTA 
GTATCACACA 
CAGTTATATA 
ATTCTTTAAA 
TTTCCTATTA 
ATTGTCATTG 
GTAGCAAATT 
TGCTTAATCC 
GTAAGTTTTT 
CAGGTTCTTT 
CATTCCTTTA 
ATTTAAATAA 



TGGACCAAAA 
GCACAGTGAA 
GAGAGAATCT 
GTCTGTTTGG 
AGACTGAAGA 
CATTTCTAGA 
TTCCTCTTTC 
GGTCCTAAAT 
GAGTATCATT 
TATTTTTTTC 
GTATATGATG 
ACCCACAAAA 
ATTCACTAAT 
CACGGAGTGT 
GGAAATTTTG 
AACTTTACAT 
TGAGGCAAAG 
GATTCTACTC 
ACAAATGGTA 
AGAACAATGT 
AATATTTGCT 



CAAAAAGAGA 
TCAGATAAAG 
GCAAAGTCAG 
AAGACCAGAA 
CCAGTATTTC 
GAGCCCCTGA 
CAAGACCAAC 
TCTTAACCTG 
CTTTACATAT 
TTAATGTTTA 
GGGCATGTGC 
AACAGAGCCT 
GATTGAGGTC 
GCCAAAACTA 
CTTTCTAACC 
CCTTTTTCCT 
TAGTGAATAT 
TTATATGCTG 
TTTTGATAGA 
TGCAACACAT 
TCACTTAAAA 



5400 
5460 
5520 
5560 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
64B0 
6540 
6600 



25 



Seq ID NO: 44 
Protein Accession #: 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



i 
I 

MGAGSSTEQR 
QLSTINGVAE 



KFTVKKDKTE 
TLKREQSHAE 



AESAHEPRLS 
STVBERTBEQ 
LSPDEKVIiSK 
EESGEHTQVP 
KKREGVTPWA 
EEPKPBEPKR 
ETGTDGILAG 
DSIAGSGVEH 
DVPAWPLSE 
AATIIEERSP 
GDTWSEAEL 
VQEVEGGVPD 
EASGLKKETD 
ETLAGVKSQE 
SGTQSGGTBA 
EGTQEADOVA 
LQSHAKSPPS 
IIDGAKEVSS 
ETLBPAGAHL 
GEKTTSLKWK 
VRTEETATEM 
AVGQAHSDIS 
ALIiAERIEKS 
QEVELQEGKV 



QDELSIiQEGD 
VHDITDDGQB 
KPDTVQbLTV 
ISPPABSGQA 
SPRKPKEDBV 
AEYEKVELPS 
KTBVEETAGS 
PPEGWSEVE 
ADSPDSQEEQ 
SPKKMVTPKK 
KVDTSVSWEA 



STPDTEPGKE 
YDAVEREKME 
SWISASVTEP 
TPBAVTAAET 
IBEQBRRTQB 
WLKVDAQSA 
MVMEQAIPPD 
EAVPAQKERP 
DEKTKDVPFF 
PVEREMWQV 
IiEGSPPPCLG 
VIiEEKSSEKN 
SDEVDEQVAC 
LTSELQTQAH 



LVEPKEDEKG 
HSESDKAITP 



21 
I 

TPAEPEPSGG 
LNGQKGALNG 
ENRNIEQIPS 
KKDEGEGAAG 
VBBCKEEGEE 
EASEKKKEQE 
EEQVSGSQGP 
VPAEBLVGMD 
KLSSQERMKV 
KGESSASSPE 
RVRRPSESDK 
LICVGSSKKR 
SSPEQAGSPT 
ESWVSIKKPI 
AQQAQKGAEQ 
LEQVEAEAAL 
AGPLGSEEGT 
VLQAVAEKVK 
KTEPFTQGKV 
SVETPTDSET 
PAPSSFVFQE 
EGLEGSIDTG 
ERBKTEAEPT 
QEEAVCTKIQ 
EDPAAHPGED 
QEVKVSVAIE 
VIKADSQDAG 
MTVEVEGSTV 
DDVDDPENQN 
QAQEELQKQE 



31 
1 

GPSAEAAPDT 
QGALNSQEBE 
SESNLEELTQ 



PEKVDTEEDG 
SBBKPAPIAT 
ABPQBABPAK 
QGSPLKKLFT 
EPBEITCZiBK 
EDELDKVKSA 
ARRRSSSDEE 
EGEGVSTWES 
PGRRKKRPDG 
PEQKAATBVS 
LTEEVLEREV 
EASAAEETTE 
BESQLPGTGG 
VGQTTPESPE 
DGSTPVADFD 



41 
I 

TADPAIAA8D 
EVIVTEVGQR 
PTESQANDIG 
AGEAASKESE 
ESPTSPVTSB 
KAEVASEKLT 
EVPDEKIBVH 
ELVKLKBTCV 
STGLKKLSGK 
GLAEVQQDGE 
TL8STESTAS 
GGPKAMGGDH 
FKRLVTPRKK 
KQEQAPVEDA 



51 
I 

PATKLLQKNG 



ITVSREKVTE 



VQSSEASFTL 
AVPTGPDCOA 
DLEPENGILE 
QBTEKEGEEP 
NDQQLEEWL 
SAIADTOASG 



IAEEEPPTVT 
MVSAVSQLTD 
PEDVLQPVQR 
KAPQVTESIE 
APGTTQKDEI 
TLEHTDKEVS 
VALKGEGTEE 
TAVTVSEEV3 
TAAAEEBKVL 
KSTPVIVSAT 
LETKSSKLVQ 
QASAQDBTPI 



GLTKESPDTN 



FKKVFKFVGF 
PKQSTEKPBE 
TGSTFKKFFT 
ASEQAHPQEP 
QEBWAEVHV 
SQEDPTQGAD 
KQKGKRGGGD 
AEEGATSDGB 
EMQEEMKG3V 
QKADEAGKDK 
SKSKLEEKSE 
GPTGANEDDS 
MAAAVADGTR 
EPLPEKREAR 
SPDTTEEATP 
AEAERPEEQA 
SSELVTTCQA 
VEIHEENEVA 
VETVSILSKT 
AECKKDDALE 
KQLLQTVNVP 
GETANILETG 
TKKGLSSDLE 
NIIQTAVDQF 
TSAKEBSEST 
TKSVPEDDGH 
GPKQKEKEDA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



Seq ID NO: 45 Nuc?,eot id? sequence; 
Nucleic Acid Accession #t NM_0Q1290 

Coding sequence: 110. .1231 (underlined sequences correspond to start and stop codons) 



1 
I 

GTGAGCGTGT 
TGGACGTTAC 
ACCACATGAC 
CATGGTACAG 
GGATAGTGAC 
ATTAACCCTT 
CATCCCCCGT 
CAAACACTCG 
CATGGTCACC 



11 
I 

GTGCGTGCGT 
TGATACTCTG 
CCCTTCTATT 
CCAGAGTACC 
AACCTCTGGT 
TCATTTTGTT 
TACTTTAGCA 
AAAGAGTCAT 
CAGCACGGGA 



21 
I 

CTACTTTGTA 
TTTAGCTTGA 
CTTCTCCTTT 
GAATCTATGA 
GGGACGCCTT 
TGGAAGATGG 
CTGTGTTTGA 
ACCACAACTC 
AGCCCATGTT 



31 
I 

CTGGGAAGAA 
TTTTCGAAAA 
CGGCCCATTT 
GATGAACAAG 
TGCCACTGAA 
ACCAAAGCGA 
AGGAGGGGTG 
ATCCATCACG 
TACCAAGGTA 



41 
I 

CACAGCCCAT 
GCAGGCAAGA. 
TATAGGAGGC 
AGACTGCAGT 
TTTTTTGAAG 
TACACTATCG 
ACCGACCTGT 
GTGGACTGCG 
TGTACAGAAG 



51 
I 

GTGCTCTGCA 60 

2GTCCAGCAC 120 

ATACACCATA 180 

CTCGCACAGA 240 

ATGACGCCAC 300 

GCAGGACCCT 360 

ATTACATTCT 420 

ACCAGTGTAC 480 

GCAGACTGAT 540 
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CTTGGAOTTC ACCTTTGATC ATCTCATGAG AATCAAAACA TGGCACTTTA CCATTAGACA 600 

ATACCGAGAG TTAGTCCCGA GAAGCATCCT AGCCATGCAT GCACAAGATC CTCAGGTCCT 660 

GGATCAGCTG TCCAAAAACA TCACCAGGAT GGGGCTAACA AACTTCACCC TCAACTACCT 720 

CAGGTTGTGT GTAATATTGG AGCCAATGCA GGAACTGATG TOGAGACATA AAACTTACAA 700 

CCTCAGTCCC CGAGACTGCC TGAAGACCTG CTTGTTTCAG AAGTGGCAGA GGATGGTGGC 840 

TCCGCCAGCA GAACCCACAA GGCAACCAAC AACCAAACGG AGAAAAAGGA AAAATTCCAC 900 

CAGCAGCACT TCCAACAGCA GCGCTGGGAA CAATGCAAAC AGCACTGGCA GCAAGAAGAA 960 

GACCACAGCT GCAAACCTGA GTCTGTCCAG TCAGGTACCT GATGTGATGG TGGTAGGAGA 1020 

GCCAACTCTG ATGGGAGGTG AGTTTGGGGA OGAGGAOGAA AGGCTAATCA CTAGATTAGA 1080 

AAACACGCAA TATGATCCGG CCAACGGCAT GGAOGACGAG GAGGACTTCA ACAATTCACC 1140 

CGCGCTGGGG AACAACAGCC CGTGGAACAG TAAACCTCCC GCCACTCAAG AGACCAAATC 1200 

AGAAAACCCC CCACCCCAGG CTTCCCAAX& AGATGATCGG CACCAGAATC CACTGTGAAT 1260 

AGGCCOGTGG GTGATCATTA CAATTGCAAA TCTTTACTTA CAGGAGAOGA AACAGAAGAG 1320 

ATAAAAACTT TTCCATGCAA ATATCTATTT CTAAACCACA ATGATCTGAT TTTCTTTCTT 1380 

CTTTCTTTTT TTCTAATTGA GAGGATTATT CCCAGTAAGC TTCCATGACC CTTTCTTGGA 1440 

GGCCTTCACA GGTAATACAG ATACTGGCAC TGATTGTAAT TAAAATGAGA GAAAACTCTA IS 00 

GCGCATCTTC TGGCACGGTT TTAACAACGT GTTTGTGTTG AATTTCCTTT TTATGCATCA 1560 

AACGAAGGCC ATATTGTCCA TAAATGCTCA GTGCTCAGGA TCTCATTAAT ATGCCGAACC 1620 

TAACTACAGA TGACTTTTTA ATATTGTAAA ATATTTTCTG CT3TTTGACT TGCATCTGAG 1680 

AGTTTCTTOT TTCAGTAAAA AAAGAAAAGA CAAAAAAATC AGCTTTGGAA AGTAATTTAA 1740 

ATGTACCTTA r m T mT I' CTTTATGTTT TCTTTCATTG GGCAACAGCT AAGAGGGCCC 1800 

AGCAAGGTAA TTTATGGTTG AGCTGATGTC AATTGGTTCT TGTCTTCAGT CGACTCAATT 1860 

TAGCCCAAGT GCTGAAACAA GAAATGTCAT TTTTTTCATC AAAGACACCA GGGCAGATTT 1920 

TTAAGTAAAG AAAGACAATT GGACCCTTAA GAATTTATGC ATTTGTAAAG TTGCTGTTGA 1980 

TCCAA ATATT TTCAAGCCAT GTAATCCATT GGTTTTGTGG GCAGTTTAAT AAACCTGAAC 2040 

CTTTGTGTGT TTTCTAATTG TACCTGAGTT GACCATCCTT TCTTTTTATA GTATATTTCT 2100 

TGTATGATAT TTTGTAAAGC TCTCACCTGG TTCTTTTATG GGGACTTTTC GTTTTTGGGC 2160 

AACTCCAGTG TATTTATGTG AAACTTTATA AGAGAATTAA TTTTTCCATT TGCATATTAA 2220 

TATGTTCCTC CACACATGTA AAGGCACAGT GGCTCOGTGT GTTAAAAAAC AGCTGTATTT 2280 
TATGTATGCT TTACTGATAA GTGTGCCAAT AATAAACTGT GTTAATGACC 



Seq ID NO j 46 Protein sequence; 
Protein Accession #: NP__001281 



1 11 21 31 41 51 

I I I I 1 I 

MSSTPHDPFY SSPFGPFYRR HTPYKVQPEY RIYEMNKRLQ SRTBDSDNLW WDAFATEFFE 60 
DDATLTIiSFC LBDGPKRYTI GRTLIPRYFS TVFEGGVTDL YYILKHSKES YHNSSITVDC 120 
DQCTMVTQHG KPMFTKVCTE GRLILEFTFD DLMRIKTWHF TIRQYRELVP RSILAMHAQD 180 
PQVLDQLSKN ITRMGLTNFT LNYLRLCVTL EPMQELMSRH KTYNLSPRDC LfCTCLFQKWQ 240 
RMVAPPAEPT RQPTTKRRKR KNSTSSTSMS SAGNNANSTG SKKKTTAANL SLSSQVPDVM 300 
WGEPTLMGG BFGDEDERLI TRLENTQYDA ANGMDDEEDF NNSPALGNNS PWNSKPPATQ 360 
ETKSENPPPQ ASQ 

Seq ID NO: 47 Nucleotide sequence; 
Nucleic Acid Accession #: NM_004126 

Coding sequence: 108.. 329 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

1 I I I I I 

GGCACGAGCT COTGCCGGCC TTCAGTTGTT TGGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCQAAAATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATTTCATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 3G0 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 

TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTT AAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 48 Protftj^ sequence; 
Protein Accession &: NP_004117 

1 11 21 31 41 51 

I 1 I I I I 

MPAXiHIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGB DPLVKGIPED 60 
KNPFKEKGSC VIS 



Seq ID NO: 49 Nucleotid e sequence: 
Kucleic Acid Accession #: xm_0518 96 

Coding sequence: 139.. 2388 (underlined sequences correspond to start and stop codons) 
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l 11 21 31 41 si 

I I I I I I 

GTTTTAAAGA CGCTAGAGTG CCAAAGAAGA CTTTGAAGTG TGAAAACATT TCCTGTAATT €0 

GAAACCAAAA TGTCATTTAT AGATCCTTAC CAGCACATTA TAGTGGAGCA CCAGTATTCC 120 

CACAAGTTTA CGGTAGTGGT GTTAOGTGCC ACCAAAGTGA CAAAGGGGGC CTTTGGTGAC 180 

ATGCTTGATA CTCCAGATCC CTATGTGGAA CTTTTTATCT CTACAACCCC TGACAGCAGG 240 

AAGAGAACAA GACATTTCAA TAATGACATA AACCCTGTGT GGAATGAGAC CTTTGAATTT 300 

ATTTTGGATC CTAATCAGGA AAATGTTTTG GAGATTACGT TAATGGATGC CAATTATGTC 360 

ATGGATGAAA CTCTAGGOAC AGCAACATTT ACTGTATCTT CTATGAAGGT GGGAGAAAAG 420 

AAAGAAOTTC CTTTTATTTT CAACCAAGTC ACTGAAATGG TTCTAGAAAT GTCTCTTGAA 480 

GTTTQCTCAT GCCCAGACCT ACGATTTAOT ATGGCTCTGT GTOATCAGGA GAAGACTTTC 540 

AGACAACAGA GAAAAGAACA CATAAGGGAG AGCATGAAGA AACTCTTGGG TCCAAAGAAT 600 

AGTGAAQGAT TGCATTCTGC ACGTGATGTG CCTGTGGTAG CCATATTGGG TTCAGGTGGG 660 

GGTTTCCGAG CCATGGTGGG ATTCTCTGGT GTGATGAAGG CATTATACGA ATCAGGAATT 720 

CTGGATTGTQ CTACCTACGT TGCTGGTCTT TCTGGCTCCA CCTGGTATAT GTCAACCTTG 780 

TATTCTCACC CTGATTTTCC AGAGAAAGGG CCAGAGGAGA TTAATGAAGA ACTAATGAAA 840 

AATOTTAGCC ACAATCCCCT TTTACTTCTC ACACCACAGA AAGTTAAAAG ATATGTTGAG 900 

TCTTTATGGA AGAAGAAAAG CTCTGGACAA CCTGTCACCT TTACTGATAT CTTTGGGATG 960 

TTAATAGGAG AAACACTAAT TCATAATAGA ATGAATACTA CTCTGAGCAG TTTGAAGGAA 1020 

AAAGTTAATA CTGCACAATG CCCTTTACCT CTTTTCACCT GTCTTCATGT CAAACCTGAC 1080 

GTTTCAGAGC TGATGTTTGC AGATTGGGTT GAATTTAGTC CATACGAAAT TGGCATGGCT 1140 

AAATATGGTA CTTTTATGGC TCCOGACTTA TTTGGAAGCA AATTTTTTAT GGGAACAGTC 1200 

GTTAAGAAGT ATGAAGAAAA CCCCTTGCAT TTCTTAATGG GTGTCTGGGG CAGTGCCTTT 1260 

TCCATATTGT TCAACAGAGT TTTGGGCGTT TCTGGTTCAC AAAGCAGAGG CTCCACAATG 1320 

GAGGAAGAAT TAGAAAATAT TACCACAAAG CATATTGTGA GTAATGATAG CTCGGACAGT 1380 

GATGATGAAT CACACGAACC CAAAGGCACT GAAAATGAAG ATGCTGGAAG TGACTATCAA 1440 

AGTGATAATC AAGCAAGTTG GATTCATCGT ATGATAATGG CCTTGGTGAG TGATTCAGCT 1500 

TTATTCAATA CCAGAGAAGG ACGTGCTGGG AAGGTACACA ACTTCATGCT GGGCTTGAAT 1560 

CTCAATACAT CTTATCCACT GTCTCCTTTG AGTGACTTTG CCACACAGGA CTCCTTTGAT 1620 

GATGATGAAC TGGATGCAGC TGTAGCAGAT CCTGATGAAT TTGAGCGAAT ATATGAGCCT 1680 

CTGGATGTCA AAAGTAAAAA GATTCATGTA GTGGACAGTG GGCTCACATT TAACCTGCCG 1740 

TATCCCTTGA TACTGAGACC TCAGAGAGGG GTTGATCTCA TAATCTCCTT TOACTTTTCT 1800 

GCAAGGCCAA GTGACTCTAG TCCTCCGTTC AAGGAACTTC TACTTGCAGA AAAGTGGGCT 1860 

AAAATGAACA AOCTCCCCTT TCCAAAGATT GATCCTTATG TGTTTGATCG GGAAGGGCTG 1920 

AAGGAGTGCT ATGTCTTTAA ACCCAAGAAT CCTGATATGG AGAAAGATTG CCCAACCATC 1980 

ATCCACTTTG TTCTGGCCAA CATCAACTTC AGAAAGTACA GGGCTCCAGG TGTTCCAAGG 2040 

GAAACTGAGG AAGAGAAAGA AATCGCTGAC TTTGATATTT TTGATGACCC AGAATCACCA 2100 

TTTTCAACCT TCAATTTTCA ATATCCAAAT CAAGCATTCA AAAGACTACA TGATCTTATG 2160 

CACTTCAATA CTCTGAACAA CATTGATGTG ATAAAAGAAG CCATGGTTGA AAGCATTGAA 2220 

TATAGAAGAC AGAATCCATC TCGTTGCTCT GTTTCCCTTA GTAATGTTGA GGCAAGAAGA 2280 

TTTTTCAACA AGGAGTTTCT AAGTAAACCC AAAGCATAGT TCATGTACTG GAAATGGCAG 2340 

CAGTTTCTGA TGCTGAGGCA GTTTGCAATC CCATGACAAC TGGATTTAAA AGTACAGTAC 2400 

AGATAGTOGT ACTGATCATG AGAGACTGGC TGATACTCAA AGTTGCAGTT ACTTAGCTGC 2460 

ATGAGAATAA TACTATTATA AGTTAGGTTG ACAAATGATG TTGATTATGT AAGGATATAC 2520 

TTAGCTACAT TTTCAGTCAG TATGAACTJC CTGATACAAA TGTAGGGATA TATACTGTAT 2580 

TTTTAAACAT TTCTCACCAA CTTTCTTATG TGTGTTCTTT TTAAAAATTT TTTTTCTTTT 2640 

AAAATATTTA ACAGTTCAAT CTCAATAAGA CCTCGCATTA TGTATGAATG TTATTCACTG 2700 

ACTAGATTTA TTCATACCAT GAGACAACAC TATTTTTATT TATATATGCA TATATATACA 2760 
TACATGAAAT AAATACATCA ATATAAAAAT 

Seq ID NO: 50 Protein sequence; 
Protein Accession #: XP_051896 

1 11 21 31 41 51 

I I I I I I 

MSFIDPYQHI IVEHQYSHKP TVWLRATKV TKGAFGDMLD TPDPYVBLFI STTPDSRKRT 60 

RHFNNDINPV WNETFEFILD PNQKNVLBIT LMDANYVMDB TLGTATFTVS SMKVGEKKEV 120 

PFIFNQVTEM VLEMSLEVCS CPDLRFSMAL CDQEKTFRQQ RKEHIRESMK KXLGPKNSEG 180 

LHSARDVPW AILGSGGGFR AMVGFSGVNK ALYESGILDC ATYVAQLSGS TWYMSTLYSH 240 

PDFPEKGPBE INEELMKNVS HNPLLLLTPQ KVKRYVESLW KKKSSGQPVT FTDIFGMLIG 300 

' ETLIHNRMNT TLSSLKEKVN TAQCPLPLFT CLHVKPDVSE LMFADWVEFS PYEIGMAKYG 360 

TFMAPDLFGS KFFKGTWKK YEENPLHFLM GVWGSAPSIL FKRVLGVSGS QSRGSTMEEE 420 

LENITTKHIV SNDSSOSDDE SHEPKGTENE DAGSDYQSDN QASWIHRMIM ALVSDSALFN 480 

TREGRAGKVH NFMLGLNLNT SYPLSPLSDF ATQDSFDDDB LDAAVADPDE FERIYBPLDV S40 

KSKKIHWDS GLTFNLPYPL ILRPQRGVDL IISFDFSARP SDSSPPFKEL LLAEKWAKKN 600 

KLPFPKIDPY VFDREGLKEC YVFKPKKPDM EKDCPTIIHP VLANINFRKY KAPGVPRETB 660 

EEK8IADFDI FDDPBSPFST FNFQYPNQAF KRLHDLMKFN TLNNIDVIKE AMVESIEYRR 720 
QNPSRCSVSL SNVEARRFFN KBFLSKPKA 



Seq ID NOt 51 Nucleotide sequence; 
Nucleic Acid Accession #: NM_006528 

Coding sequence: 57.-764 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I I I 

GCCGCCAGCG OCTTTCTOGG ACGCCTTGCC CAGCGGGCCG CCOGACCCCC TGCACCATGG 60 

ACCOCGCTOG CCCCCTGGGG CTGTCGATTC TGCTGCTTTT CCTGACGGAG GCTGCACTGQ 120 

GOGATGCTGC TCAGGAGCCA ACAGGAAATA ACGCGGAGAT CTGTCTCCTG CCCCTAGACT 180 

AOGGACCCTG CCGGGCCCTA CTTCTCCGTT ACTACTACGA CAGGTACAOG CAGAGCTGCC 240 

GCCAGTTCCT GTAOGGGGGC TGCGAGGGCA ACGCCAACAA TTTCTACACC TGGGAGGCTT 300 

GCGACGATGC TTGCTGGAGG ATAGAAAAAG TTCCCAAAGT TTGCOGGCTG CAAGTGAGTG 360 

TGGACGACCA GTGTGAGGGG TCCACAGAAA AGTATTTCTT TAATCTAAGT TCCATGACAT 420 

GTGAAAAATT CTTTTCCGGT GGGTGTCACC GGAACCGGAT TGAOAACAGG TTTCCAGATG 460 

AAGCTACTTG TATGGGCTTC TGCGCAOCAA AGAAAATTCC ATCATTTTGC TACAGTCCAA 540 

AAGATGAGGG ACTGTGCTCT GCCAATGTGA CTCGCTATTA TTTTAATCCA AGATACAGAA 600 

CCTGTGATGC TTTCACCTAT ACTGOCTGTG GAGGGAATOA CAATAACTTT GTTAGCAGGG 660 

AGGATTGCAA ACGTGCATGT GCAAAAGCTT TGAAAAAGAA AAAGAAGATG CCAAAGCTTC 720 

GCTTTGCCAG TAGAATCCGG AAAATTCGGA AGAAGCAATT TTAAACATTC TTAATATGTC 760 

ATCTTGTTTG TCTTTATGGC TTATTTGCCT TTATGGTTGT ATCTGAAGAA TAATATGACA 840 

GCATGAGGAA ACAAATCATT GGTGATTTAT TCACCAGTTT TTATTAATAC AAGTCACTTT 900 

TTCAAAAATT TGGATTTTTT TATATATAAC TAGCTGCTAT TCAAATGTGA GTCTACCATT 960 

TTTAATTTAT GGTTCAACTG TTTOTGAGAC GAATTCTTGC AATGCATAAG ATATAAAAGC 1020 

AAATATGACT CACTCATTTC TTGGGGTCGT ATTCCTGATT TCAGAAGAGG ATCATAACTG 1080 

AAACAACATA AGACAATATA ATCATGTGCT TTTAACATAT TTGAGAATAA AAAGGACTAG 1140 
CC 

Seq ID NO: S2 Protein, sequence j 
Protein Accession #: NP_006519 

1 11 21 31 41 51 

I I I I 1 I 

MDPARPLGLS ILLLFLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALIiL RYYYDRYTQS 60 

CRQPLYGGCE GNANNFYTWB ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 120 

TCEKFFSGGC HRNRIENRFP DEATCKGFCA PKKIPSFCY3 PKDEGLCSAN VTRYYFNPRY 180 
RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQP 



Seq ID NO: 53 Nucleotide sequence; 
Nucleic Acid Accession #> AA476778 
Coding sequence: no ORP found 



1 11 21 

I I I 

TATTTTTGTA CGTAAAATGA TTCTATTATG 
TGATCCTTCA TTATCACGGT ACACTATTGT 
ACTTTTTTAA AATGAATTTT TTTAAAACAA 
GTATAAAAGA TATTTTTGGC ATTTCTAGGC 
TCACAGATTG TACCAACTAT TAACTATGTT 
GGAAAAAAAT ATGCTGCCTT GGTGCTAATA 
GAAATATAAA CACTTTTAAT GAAAGGGAGG 
CACTTGGATG AAATAAGACC AGCTCTTTAC 
AGACTTAGAC TTTATCCTTA TTGTTGTTAG 
GTGCCTTGGT CTCTCCACAA TCAAATGGAG 
TATTGGGAAA GTGAGATCCT CTCACCATTT 
TTACCAGTAG AAAGACACAG GATGCACAGA 
CTGGAGAAAT TCAGAACCAG GTTCTGAATC 
GCTGGTGATG TGACTTCTCT TCAGGCCATG 
CTGCAGTAAT GGAOGTTTGT GTGAAGAAAT 
TTCCGATTGC TCATTAATTC ACTTTTTTGT 
CATGGTCTTT CTGCCCCTCC AAGCTGATGA 
ACACTTGGTT TGAGAAACCC TGCCCACTTC 
CAGTATTCTC CAACTCCAAA CAAGCTCTAG 
TGAATAAGTG TTATTCTCCA TTATTAATGT 
CACCACACCC AAAAAAAAAA AAAAAAAAAA 



31 41 51 

I I I 

ACTGCCTTTG CATGTAGTAA TATGACAAAG 60 

TTACTTTTCA TCTGTAAATG TTTTATTGTT 120 

TCTAGCCATC ATCAAGGTGC TATAAGAGTT 180 

AAGTATCAGC CAATAAGTAT GTTAGTGATA 240 

AAATAAGTAT TCAOTTTCAT GTGATCTCTG 300 

TTGTATGTAT TTAAATGATC ATCTGACTCA 360 

AAOGGAAGGA CAATTTCCAG TGCACAGAAT 420 

CCTTATTTTT GGATATGCCT TTTTTGGAAG 480 

TGTTGTTAAT ATTCGTTGCT TCAGCCCACG 540 

GATCCCCCAA GCAGCTTCAT TACAGAGTGA 600 

TGCCAAGATA CTCTAAAATG ACATCCAAGT 660 

ATGGGCATGA CCTTCAGCTC ACGAGCACAC 720 

ATCACGATTG CCTTTTGCAT GAAAACATCG 780 

AGCCTAACAY CCTGCOGGTT TTCATGCCCG 840 

GAACTGTGGA GTACAAAATG CTTTGAGTCT 900 

TACTTCTTTC CAAAATGGAA GTGCTGAAGC 960 

AGGGAAGCCT TTGCCAATGG CCCATGGAAG 1020 

CAAAGACCAA AGAGATTAGG AAAAGCCTGG 1080 

AGTGCTCCAG GAAAAGTTAT ATTCAGTATA 1140 

GTTCTGAAAA TATATTATGA ATAAATACAT 1200 
AAAA 



Seq ID NO: 54 Nucleotide sequence s 
Nucleic Acid Accession #.- NMJ>20663 

Coding sequence: 1..645 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

ATGAACTGCA AAGAGGGAAC TGACAGCAGC TGCGGCTGCA GGGGCAACGA CGAGAAGAAG 60 

ATGTTGAAGT GTGTGGTGGT GGGGGACGGT GCCGTGGGGA AAACCTGCCT GCTGATGAGC 120 

TACGCCAACG ACGCCTTCCC AGAGGAATAC GTGCCCACTG TGTTTOACCA CTATGCAGTT 180 

ACTGTGACTG TGGGAGGCAA GCAACACTTG CTCGGACTGT ATGACACCGC GGGACAGGAG 240 

GACTACAACC AGCTGAGGCC ACTCTCCTAC CCCAACACGG ATGTGTTTTT GATCTGCTTC 300 
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TCTOTCGTAA ACCCTGCCTC TTACCACAAT GTCCAGGAGG AATGGGTCCC CGAGCTCAAG 3S0 

GACTGCATGC CTCACGTGCC TTATGTCCTC ATAGGGACCC AGATTGATCT CCGTQATQAC 420 

CCAAAAACCT TOGCC OOlTf GCTQTATATG AAAGAGAAAC CTCTCACTTA CGAGCATGOT 480 

GTGAAGCTCG CAAAAGCOAT CGGAGCACAG TOCTACTTGG AATOTTCAGC TCTGACTCAG 540 

AAAGGTCTCA AAGCGGTTTT TGATGAAGCA ATCCTCACCA TTTTCCACCC CAAGAAAAAG 600 
AAGAAAOGCT OTTCTGAGGG TCACAGCTOC TGTTCAATTA T CTGA 



Seq ID NO: 55 
Protein Accession #: 



NP 065714 



1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDEKK MLKCVWGDG AVGKTCLLMS YANDAFPEEY VPTVPDHYAV 
TVTVGGKQHL LGLYDTAGQS DYNQLRPLSY PNTDVFLICF SWNPASYHN VQEBWVPBLK 
DCMPHVPYVL IGTQIOIiRDO PKTLARLLYM KEKPLTYEHG VKLAKAIGAQ CYLECSAI/TQ 
KGLKAVFDEA ILTIPHPKKK KKRCSEGHSC CSII 



60 
120 
180 



Seq ID NO: 56 Nucleotide sequence: 

Nucleic Acid Accession ft: fgeneah prediction 

Coding sequence: 1-546 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

A TQ GCCTTGG GCAGCTCCGC CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTGCT 60 

TTCATGGCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCCCCTGGG GCTCCCCGAG 120 

TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTGTGCC CGCAGCCCTG 180 

CAGCTCGCCG GTGCCCTCGA CCTGCCCGCT GGGTCCTOTG CCTTTGAAGA GAGCACTTOC 240 

GGCTTTGACT CCGTGTTGGC CTCTCTGCOG TGGATTTTAA ATGAGGAAGG CCAGCAACCT 300 

TTCTGGTCCT CAGGAGACAT GTCTGACTGG GACTACTGGG TTGGCTGGCG GAAOTTAATT 360 

CATTCTCCTC TGAGCACTCC AGGGTOGAGC AGGCAGGTTA GGCTCCAGTT GTTCCAGCTT 420 

CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 480 

AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTTCA TGTACTGGAT CCACCATGGA 54 0 
AAGTAG 



Seq ID NO; 57 Protein sequence 

Protein Accession #t fgenesh prediction 

1 11 21 31 41 51 

I I I I I I 

MALGSSAPVA LQGNAHPPAA FMAGIKCLWL FQWPLGLPE LVQRLLGGAR TETRPVPAAL 60 

QLAGALDLPA GSCAFEESTC GPDSVLASLP WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 120 

HSPLSTPGWS RQVRLQLFQL QFVKGQNLDV TVYCRLQGSB KPFETGSMVP FTFMYWIHHQ 180 
K 

Seq ID NO: 58 Nucleotide sequence; 
Nucleic Acid Accession #: XM_050478 

Coding sequence: 27.. 4508 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I I 1 1 I 

CCGGCGGCGC CTGAGCCCAG CCGAGGATGG AGAACCGGCC TGGGTCCTTC CAGTACGTCC 60 

CTGTGCAGCT GCAAGGGGGG GCACCCTGGG GCTTCACCCT TAAGGGGGGT CTGGAACACT 120 

GTGAGCCGCT CACAGTGTCT AAGATTGAAG ATGGAGGCAA GGCAGCTTTG TCCCAGAAGA 180 

TGAGGACTGG TGATGAGCTG GTGAATATCA ATGGCACTCC ATTATATGGC TCCCGCCAAG 240 

AGGCCCTCAT TCTCATCAAA GGCTCCTTCC GGATTCTCAA GCTGATTGTC AGGAGGAGGA 300 

ACGCCCCTGT CAGTAGGCCG CACTCATGGC ATGTGGCCAA GCTGCTGGAG GGATGCCCTG 360 

AAGCAGCCAC CACCATGCAT TTCCCTTCTG AAGCCTTCAG CTTGTCCTGG CATTCTGGCT 420 

GCAACACAAG TGACGTGTGT GTGCAGTGGT GTCCACTCTC CCGGCATTGC AGCACCGAGA 480 

AAAGCAGCTC CATTGGCAGC ATGGAGAGCC TGGAGCAACC AGGCCAAGCC ACCTATGAGA 540 

GCCATCTGTT GCCTATTGAC CAGAACATGT ACCCTAACCA GCGTGACTCA GCCTACAGCT 600 

CCTTCTCGGC CAGCTCAAAT GCTTCTGACT GTGCCCTITC CCTCAGGCCA GAGGAGCCAG 660 

CCTCTACAGA CTGCATCATG CAAGGCCCAG GGCCAACTAA GGCCCCCAGT GGCCGGCCTA 720 

ATGTGGCTGA GACCTCAGGA GGTAGTCGGC GCACCAATGG GGGCCACCTQ ACCCCCAGCT 780 

CTCAGATGTC ATCCCGTCCA CAGGAGGGAT ACCAGTCAGG GCCCGCCAAA GCAGTCAGGG 84 0 

GCCCACCACA ACCTCCAGTG AGGCGGGACA GCCTTCAGGC CTCCAGAGCC CAACTCCTCA 900 

ATGGAGAGCA GCGCAGGGCA TCTGAGCCTO TGGTCCCCTT GCCACAGAAG GAGAAACTGA 960 

GCTTAGAGCC TGTGCTACCC GCAAGGAACC CTAATAGGTT CTGTTGCCTC AGTGGGCATG 1020 

ACCAAGTGAC AAGTGAGGGC CATCAGAACT GTGAGTTCAG TCAGCCTCCT GAATCCAGCC 1080 

AACAGGGCTC TGAGCATCTA CTGATGCAGG CCTCAACCAA AGCTGTTGGA TCCCCAAAAG 1140 

CCTGTGACAG AGCTTCCAGC GTGGATTCCA ACCCACTCAA TGAGGCTTCT GCAGAGCTAG 1200 

CTAAGGCTTC TTTTGGCAGA CCTCCACATC TCATAGGACC CACAGGGCAT CGCCATAGTG 1260 

CCCCTGAACA GCTGCTCGCA TCCCACCTGC AGCATGTX3CA CCTTGATACC AGGGGCAGCA 1320 

AAGGGATGGA GCTCCCACCC GTACAGGATG GGCACCAGTG GACTCTGTCC CCTTTGCACA 1380 



206 



WO 02/079492 



PCTAJS02/04915 



GCAGCCACAA AGGGAAGAAA AGTCCATGCC 
GCAAAGAAAG AAAGACCAGA CAAGTGGATG 
AAAGCAQTCC CCCACATGGA GAGGCTGATC 
CAAACAGAAC AAGCAQAGCA GCCAGTGAAT 
5 CCCTTGTTCA ACAAGCCACG GACTGTTCTT 
CAGQTGAAGA AGGGGACAGC GAGCCCAAOG 
GAGGGACCOG GGGCCGCTOG ATCCAAAACC 
TGCGTAATGA AATTCAGAGG AGGAAGGCCC 
AGCTGTGTGA CACTAAGGAG CCAGTGGAAG 

10 TCACTCCCTC TAACACATCT CTTCTATCTT 
AGCTCTTCAA CAAAAGCATG ATGCTCAGAG 
CTGAGAGCCA TGAATCTAGG ACAGGCTTAG 
GCCAGTCCTC TTTGGGCCTG AACACCTGGT 
CTGAGAAAGC ACATGCTCAC TGTGGAGTCC 

15 ATAATTCACA GCCACTTGTG GCAGCAGCCA 
AGGAATTGAA GGCTTCTACT GCTCAAGCTG 
ACAGAAGAAA GTTCTTTGAA GAGAGTAGCA 
TAACCACTCA TAGCAACAAG ACTTTTACCC 
AGCCAATGAG CTCCAGCTGT AGGGAATTGA 

20 CCGCAGACCA ACCATATCAT GCCACAGACC 
CAGAAACTCC CACTTACTCA GAATGTTTTG 
GTAAGCCACT ACACTGTGGT GATTTTGATT 
TTCAAGGAGC TCTAGTCCAT GATCCTTGCA 
TGCTAAAGAG AAATATGATG CCAAATTGCT 

25 TTCGGTGTTC AGTTTGCTAT CATAATCCTC 
CACCTGGCAA CACTTGGAAA CCCAGGAAGC 
GGAATCCAAT AACAGGAAAC AGGAAGACCA 
AGACTAGCTT TTCATGGGCA ACCCCTTTCC 
TGTCAAGCTA CCGAGCAATT TCTTCTCTTG 

30 AAAAATCAGA GGAAACTTCA GTTTATGAGG 
CACTGCGCAG CCGTGCCTTC TCAGAGAGTC 
CCTGGGGGCA GCATAGGAGG GAGCTCTTTA 
TCGGAGCCAG GAAGAAGGCC TTTCCTCCTC 
ACAGGCTCTT TOGTGCAGCC CAGCAGCAGA 

35 AGGAGGAGGA GGAGGAGGAA GAAGAAGAAG 
CAGAGGAGGA GGAAGAGGAG CTGCCACCCC 
GTGCTCTCAA TCCTOAGGAG GTCCTAGAGC 
AGGGCTCGAG ACAGGGTTCA CAAAGTGTCC 
CCAGTGATTT CTTGCCTCCA ATAAGGGGTC 

40 CCCCTTGCTA CTATGGCATT GGTGGGCTTT 
CCGCCAAACA AGAGTTTCAG CACTTTTCGC 
CTTACTCAGC TTATTACAAT ATTTCTGTGG 
ACCAACCTGA GATGGCAGAG ATTGGCCTAG 
AAAAAAAGAT ACAGCTTATC GAAAGCATCA 

45 AGCGAGGGCT GCTAGAGGAC ATCAATGCCA 
ACTTAAAAGC CGTCTGCAAA TCCAATGAAT 
TGGACAAAGT GGTCAACCTG TTGCTGTCAC 
CTCTGAACAG CATCGATTCA GAGGCCAACC 
AGCAGCTGAC GGGGCAGTTG GCAGATGCCA 

50 AGAAGTTGGT GTTTGGCATG GTCTCCCGCT 
AGCACTTTGT CAAGATGAAA TCTGCTCTCA 
TCAAGCTCGG GGAAGAGCAA CTCAAATGTC 
ATTT CTAA TT CTACCAGCAC TCTGCCACAG 
TCAATCTTCT TTGTTAGCAG TTTCTCAGCA 

55 CCTCTACCCT GGATGTCTCT CACTACCCCT 
ACCCTGGGGA AGCCACAAGC TTCTACCCAA 
CACACTCTCC TTCCCACAGT TGCCAAGGGC 
TGCCTTCATT CTGCTTTGTA CTAGGACACC 
ATCATCAACA GCCTCTAAAG GCTCAGAGGG 

60 GGCTTGTGGC CAGCCATTTC TCACAGAGAG 
CAGTTTCAGG GCCTCACCCA AGCTTTGCAG 
AAAAAATGCA AGCAAAGGTT GAGTACCCCC 
ATAGGCTCTA CCCTTACCTT TCCCAGCAGC 
CTGGCTAGTG TGACCCTCTT CCTGTCCTAA 

65 TTTCCTTTAC ATTGCTGGGG GTTACCGCAG 
CATTAATAGC TCTACTAAAA CTGACTTCTA 
TCTTATTGTT ATATTTTAAA TGGCCTTTTG 
TTTTCTTTTT TAACTAATAA GGCGAGAAGA 
AAGGAAAGCA TTTTCTGCAG ATCAGCCTGA 

70 ' TCTCGTGTTG CTCACAACTA CCTGCCTGGA 
TAAAACACAA GATCAAATGA ACAATCCGAA 
CATGGTGGCT CACGCCTGAA ATCCCAGCAC 
GTCAGGAGAT CAAGACCATC CTGGCTAACA 
AAAAATTAGC CAGGTGTGGT GGCACGCACC 

75 AGGAGAATTG CTTGAACCTG GAAGGCAGAG 
CTCCATCCTG GGCAACAGAG TGAGACTTTG 



CCCCTACAGG AGGAACCCAT GACCAGTCCA 1440 

ACAGGTCTTT AGTTTTGGGA CACCAGAGCC 1500 

GACACCCCTC AGAAAAAGGT TTCCTGQACC 1560 

TGGCCAACCA GCAACCCTCT GCCTCTGGCT 1S20 

CAACCACTAA AGCAGCTAGT GGCACAGAGG 1680 

AGTGCAGCCG GATGGGTGGT AGGCGAAGTG 1740 

GGCGGAAGAG TGAGCGTTTT GCTACCAATC 1800 

AGCTCCAGAA AAGCAAGGGT CCCTTGTCAC 1860 

AGACCCAGGA GCCCCCAGAA AOTCCTCCAC 1920 

CATGTAAAAA ACCTCCCAGC CCCAGAGACA 1980 

CTAGGTCTTC CGAGTGCCTC AGCCAAGCCC 2040 

AGGGAOGAAT AAGCCCTGGC CAGAGGCCTG 2100 

GGAAAGCACC TGACCCATCC TCCTCAGACC 2160 

GTGGAGGTCA TTGGAGATGG TCTCCAGAGC 2220 

TGGAAGGCCC TTCCAACCCA GGTGACAACA 2280 

GGGAGGATGC CATCCTCTTG CCTTTTGCAG 2340 

AATOCTTATC TACATCTCAT TTGCCAGGTT 2400 

AGAGACCAAA ACCTATAGAC CAAAACTTCC 2460 

GGCGCCATCC CATGGACCAA TCATATCATT 2520 

AATCATATCA TTCCATGTCA CCCCTTCAGT 2580 

CAAGCAAAGG TCTAGAAAAT TCCATGTGTT 2640 

ACCACAGGAC CTGCTCTTAC TCCTGCAGTG 2700 

TTTATTGTTC TGGGGAAATC TGCCCTGCCT 2760 

ACAACTGCCG GTGCCACCAC CACCAATGCA 2820 

AGCACAGTGC CCTCGAGGAC AGCAGCTTGG 2880 

TGACAGTGCA GGAATTTCCT GGGGACAAAT 2940 

GCCAGTCAGG GAGGGAAATG GCTCATTCCA 3000 

ATCCTTGCCT TGAGAACCCA GCACTGGACT 3060 

ACCTCCTTGG AGACTTCAAA CATGCTTTGA 3120 

AGGGGAGCTC CCTTGCCTCC ATGCCCCACC 3180 

ACATCAGCTT GGCGCCCCAA AGCACCCGGG 3240 

GCAAAGGTGA TGAGACCCAG TCGGATCTTC 3300 

CTOGCCCTCC TCCTCCCAAC TGGGAGAAGT 3360 

AGCAGCAACA GCAGCAGCAG AAGCAACAGG 3420 

AAGAGGAAGA GGAAGAGGAG GAGGAGGAGG 3480 

AGTATTTCAG TTCAGAAACC TCTGGTTCCT 3540 

AGCCACAACC CCTCAGCTTT GGCCACCTGG 3600 

CAGCAGAGCA AGAATCCTTT GCACTCCATT 3660 

ACTTGGGATC TCAACCTGAG CAGGCTCAGC 3720 

GGAGGACATC GGGACAGGAA GCCACTGAAT 3780 

CTCCTTCAGG GGCCCCAGGA ATCCCTACCT 3640 

CCAAGGCAGA GCTGCTGAAC AAACTGAAAG 3900 

GAGAGGAGGA AGTTGACCAT GAACTGGCTC 3960 

GCAGAAAACT TTCTGTCTTG CGGGAGGCCC 4020 

ATTCTGCCCT TGGGGAGGAG GTGGAGGCCA 4080 

TTGAAAAGTA CCACTTGTTT GTTGGGGACC 4140 

TCTCTGGACG ACTGGCCOGG GTGGAGAATG 4200 

AGGAGAAGTT GGTACTGATA GAGAAGAAGC 4260 

AGGAGCTGAA GGAGCACGTG GACCGCCGGG 4320 

ACCTGCCTCA GGACCAGCTC CAAGATTACC 4380 

TCATTGAACA GCGAGAGCTG GAGGAGAAGA 4440 

TCAGGGAGAG TCTACTCCTG GGGCCCAGCA 4500 

CATCCCTGCC CAGCCATGTG GGAAGTGCTT 4560 

AGTAGATAGC AATTAGCAGT TTGTTCCAGC 4620 

TCCCTAGCAG TGGTCCTAAC CAGCTAGGAG 4680 

GGGAGCT6CA GCAAGGTGTG ATCTTAGAAC 4740 

AAGTACTTGC TGCACAGAGA ACCAAGGAAG 4800 

AAAGACATCA AGTACTCATC ACCCACCCAT 4860 

AATCTGCCTT GCAGCTCTAC TCTGCCCCAG 4920 

CTGGCTGCCT TGAGGGCATT CACCTGGCAC 4980 

GGGAAAGCAC AGAGGGAGGA ATTACACTGA 5040 

AGGTGCCCCT TAGGAAGGAA CCAGGTTTAA 5100 

AAGTTCAGGG GAAGAGGCCT ACTCTTAGCC 5160 

GACTTTGGTC CTACCACCTC TTOTTTCATC 5220 

GTGCCTACCC CAGGGCTTCA CCATATGGGC 5280 

GATGTAGGTT TCATTATTGG GGGAGGGGGT 5340 

ATTTTATTTA TTTTTATGTT TTGATTATTT S400 

GGGAAGTTGG AGAGGGAAAA GTTAGCCCAG 5460 

ATCCACCGTG GCTAGGCATA TTCTTGCTCT S520 

TGAATTTAGG AAAGTTGCAG GATACAAGGT 5590 

AATGTTATTA AGAAAACAGT TCCGGCCGGG 5640 

TTTGGGAGGC CGAGGCAGGT GGATCACGAG 5700 

CGGTGAAACC CTATCTCTAC TAAAAATACA 5760 

AGTAGTCCCA GCTACTCGGG AGGCTGAGGC 5820 

ATTGCAGTGA GCTGAGACCA CACCACTGCA 5880 

TCTCAAAAAG AAAGAAAGAA AGAAAGAAAG 5940 
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AAAGAAAGAA AGAAAAGAAA GAAAGAAAGA AAGAAAGAAA ACAGTTCCAT TTACAATAGC 6000 
ATC 

Seq ID NO: 59 Protean sequence: 
Protein Accession #t XPJJ50478 

1 11 21 31 41 51 

I I I I I I 

MENRPGSFQY VPVQLQGGAP WGFTLKGGLE HCEPMVSKI EDGGKAALSQ KMRTGDBLVN 60 

INGTPLYGSR QEALILIKGS FRILKLIVRR RNAPVSRPHS WHVAtOtLBGC PBAATTMHFP 120 

SEAPSLSWHS GCNTSDVCVQ WCPLSRHCST BKSflSIGSMB SLBQPGQATY ESHLLPIDQN 160 

MYPNQRDSAY SSPSASSNAS DCALSLRPEB PASTDCTMQG PGPTKAPSGR PNVAETSGGS 240 

RRTNGGHLTP SSQMSSRPQE GYQS6PAKAV RGPPQPPVRR DSLQASRAQL LNGEQRRASE 300 

PVVPLPQKEK LSLEPVLPAR NPNRFCCLSG HDQVTSEGHQ NCBPSQPPBS 8QQGSBHLLM 360 

QAGTKAVGSP KACDRASSVD SHPLNBASAE LAKASFGRPP HLIGPTGHRH SAPBQUiASH 420 

LQHVHLDTRG SKGMELPPVQ DGHQWTLSPL KSSHKGKKSP CPPTGGTHDQ SSKERKTRQV 480 

DDRSLVLGHQ SQSSPPHGEA DGHPSBKGFL DPNRTSRAAS ELANQQPSAS GSLVQQATDC 540 

SSTTKAASGT EAGEEGDSEP KECSRHGGRR SGGTRGRSZQ NRRKSERFAT NLRMSIQRRK 600 

AQLQKSKGPL SQLCDTKEPV EETQEPPSSP PLTASNTSLL SSCKKPPSPR DKLFNKSMML 660 

RARSSBCLSQ APESHESRTG LEGRISPGQR PGQSSLGLNT WWKAPDPSSS DPEKAHAHCG 720 

VRGGHWRWSP EHNSQPLVAA AMEGPSNPGD NKELKASTAQ AGEDAILLPF ADRRKPPEBS 780 

SKSLSTSHLP GLTTHSNKTP TQRPKPIDQN FQPMSSSCRE LRRHPMDQSY HSADQPYHAT 840 

DQSYHSHSPL QSBTPTYSEC PASKGIiBNSM CCKPLHCGDF DYHRTCSYSC SVOjGALVHDP 900 

CIYCSGEICP ALLKRNMMPN CYMCRCHHHQ CIRCSVCYHN PQHSALBDSS IAPGNTWKPR 960 

KLTVQEFPGD KWNPITGNRK TSQSGREMAH SKTSPSWATP PHPCLENPAL DIiSSYRAISS 1020 

LDIiLGDFKHA LKKSEBTSVY EBGSSLASMP HPLRSRAPSB SHISIAPQST RAWGQHRREL 1080 

FSKGDETQSD LLGARKKAFP PPRPPPPNWE KYRLFRAAQQ QKQQQQQQKQ QEEBEEBEEB 1140 

EEBEEEEEBE EAEBEBBELP PQYPSSETSG SCALUPEEVL EQPQPLSFGH LEGSRQGSQS 1200 

VPAEQESFAL HSSDFLPPIR GHLGSQPEQA QPPCYYGIGG LWRTSGQRAT ESAKQEPQHF 1260 

SPPSGAPGIP TSY8AYYMIS VAKABLLNKL KDQFEMAEIG LGEEEVDHEL AQKKIQLIES 1320 

ISRKLSVLRE AQRGLLEDIN ANSALGEEVE ANLKAVCKSN EPEKYRItPVG DLDKWNLLL 1380 

SLSGRIiARVE NALNSIDSBA NQEKLVLIEK KQQLTGQLAD AKBLKBHVDR REKLVFGMVS 1440 
RYLPQDQLQD YQHFVKMKSA LIIBQRBLEB KIKLGBEQLK CLRBSLLIGP SNF 1 



Seq ID NO: 60 Nucleotide aeouence: 
Nucleic Acid Accession #: NMJU4705 

Coding sequence: 192.. 2489 (underlined sequences correspond to start and stop codonB) 



1 11 21 31 41 51 

1 I I I I I 

GGGAGAAGCT AGGAAAAAAT GTCTTTGAGC TGTGAGATGC TTGTATATTT TGAAAATATG 60 

ATTATATGCA TGTGTTTGTA TTTTATGACT TGGATAATCT OAAAATCAAT TTGCTTTGTC 120 

AATGCTTCCT GGATTAGAAT TCCACTATTT GGTCCCTATC CTAGTCTACT AAAGAAAATT 180 

GAGCGGGAAA CATGGCGGGA AAGTGGCGTT TCATTAATTG CTACTGTAAC TCGTCTAATG 240 

GAGAGGTTGT TAGATTACAG AACTTCTATA AGACTGAACT GAACAAGGAG GAGATGTATA 300 

TACGCTACAT TCACAAACTC TATGATCTGC ATCTCAAAGC ACAGAACTTT ACAGAAGCTG 360 

CATATACCCT CCTCTTATAT GACGAGCTAC TGGAATGGTC TGATCGGCCC CTCAGGGAGT 420 

TCCTGACCTA CCCCATGCAA ACAGAATGGC AGCGCAAAGA GCACCTGCAC CTCACCATCA 480 

TCCAGAACTT TGACAQAGGC AAATGTTGGG AGAATGGCAT TATCTTGTGC CGGAAGATTG 540 

CAGAGCAGTA TGAGAGTTAT TATGACTACA GAAACCTGAG CAAGAIGCGG ATGATGGAAG 600 

CCTCTTTGTA TGACAAAATT ATGGACCAGC AACGTCTTGA ACCAGAGTTC TTCAGAGTTG 660 

GATTTTATGG AAAAAAATTT CCATTTTTCT TAAGAAATAA GGAGTTTGTG TGTCGAGGGC 720 

ATGACTACGA GAGGCTGGAA GCCTTCCAAC AGAGAATGCT GAACGAGTTC CCCCATGCCA 780 

TCGCCATGCA GCACGCCAAC CAGCCCGATG AGACCATCTT CCAGGCAGAA GCTCAGTATT 840 

TGCAGATATA TGCTGTGACT CCCATTCCAO AGAGCCAGGA GGTCCTGCAG AGAQAGGGTG 900 

TTCCOGACAA CATCAAAAGC TTCTATAAAG TGAATCACAT CTGGAAATTC CGCTATGACC 960 

GACCATTTCA CAAAGGCACA .AAAGATAAAG AGAATGAATT CAAGAGTCTC TGGGTGGAGA 1020 

GAACGTCATT ATACTTGGTG CAGAGTTTGC CTGGCATCTC TCGCTGGTTT GAAGTGGAAA 1080 

AGCGTGAAGT GGTAGAAATG AGTCCTCTGG AAAATGCAAT TGAAGTGCTA GAAAATAAGA 1140 

ATCAGCAGCT GAAGACTCTG ATTAGTCAGT GTCAGACAAG ACAGATGCAG AATATTAATC 1200 

CCCTGACTAT GTGCCTGAAT GGAGTTATAG ATGCTGCAGT TAATGGTGGC OTTTCCAGGT 1260 

ATCAAGAGGC ATTCTTTGTC AAAGAATATA TCTTAAGTCA CCCTGAAGAT GGGGAGAAAA 1320 

TTGCACGATT AAGAGAGCTG ATGCTTGAGC AGGCACAGAT TCTGGAATTT GGTTTGGCCG 1380 

TGCATGAGAA GTTTGTACCT CAAGATATGA GACCCCTTCA CAAAAAGCTG GTTGACCAAT 1440 

TCTTTGTGAT GAAGTCGAGC TTAGGGATAC AGGAGTTCTC TGCTTGTATG CAAGCCAGTC 1500 

CTGTCCATTT TCCTAATGGA AGCCCTOGTG TGTGTAGAAA CTCAGCACCT GCTTCTGTGA 1560 

GCCCAGATGG TACCAGGGTA ATTCCTAGAC GCAGCCOGTT AAGTTACCCA OCTGTCAACC 1620 

GATATTCTTC CTCCTCACTG TCCTCACAAG CTTCTGCTGA AGTAAGCAAT ATTACAGGGC 1680 

AATCAGAAAG CTCTGATGAA GTCTTTAACA TGCAGCCAAG TCCATCTACC TCAAGCTTGA 1740 

GTTCTACTCA CTCGGCTTCA CCTAATGTGA CAAGTTCTGC TCCATCGAGT GCCAGAGCTT 1800 

CTCCTTTGTT GTCTGACAAA CACAAACATT CCCGAGAAAA CTCTTGCCTG TCACCAAGAG 1860 

AGAGACCATG CAGTGCCATC TATCCAACAC CTGTGGAGCC TTCGCAGAGG ATGCTGTTTA 1920 

ATCATATTGG AGACGGGGCC TTGCCACGCA GTGACCCAAA TCTCTCTGCA CCTGAAAAAG 1980 

CTTCACCAGC AAGACACACG ACATCAGTAT CCCCCTCGCC TGCCGGGCGA TCTCCATTGA 2040 
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AGGGCTCTGT GCAGTCTTTC ACCCCCTCTC CAGTGGAGTA CCACTCGCCA OGACTCATCT 2100 

CCAACTCCCC TGTCTTGTCG GGCAGCTACA GCAGTGGGAT TTCTTCTCTC AGCCGGTGCA 2160 

GCACGTCGGA AACCTCAGGC TTTGAAAATC AGGTGAATGA ACAGTOQGCC CCCCTGCCGG 2220 

TGCCAGTGCC GGTGCCCGTG CCGA6CTACG GCGGGGAGGA GCCAGTGCGC AAGGAGAGCA 2280 

AGACTCCGCC CCOGTACAGC GTCTACGAGC GGACTCTGCG GCGCCCCGTC CC3GCTACCTC 2340 

ACAGCCTCTC CATCCCCGTC ACQTCGGAGC GGCCOOCGCT GCCCCOCAAO CCTCTGQCAG 2400 

CGCGATCCAG CCACCTGGAG AATOGGGCCC OGAGGACTGA CCCCGGCCCG CGGCCCAGGC 2460 

CCCTGCCCOG CAAGGTCTCT CAGTT ATAAQ TCACTTTTCT ATGTACCTGC QATGCATTCT 2S20 

TTGCCCGTTT ACAAAATAAG AAGTATGATC3 AOAAGACATT TAOTGTAGGC ACTTTAATAA 2560 

CTTACTCAGC TCCTTCGATG AATGGAATTA AAACTTGCTT ATTAAATATC ATGTTGCACA 2640 

ATATTAAAAG TTGCTGATCT AAAACGCCAG ATGTTAAATG AAGTATGGCT GAATTTCATT 2700 

AAAAOGTTTC TCATTTGGAA GTGGTAAATA GTGATAAAGA CTCCTTTTGT ACCTTTTTAT 2760 

GTTCACTTTT TTTTATATAG TTTAATCTTA AAACCAATAC GATATTGTCA AACGATACAA 2820 

TGTGTGACAA TGTTGTATCG TTTTTACTGA ATACTTGATA CTTGGAGAAA GCTTATTAAG 2880 

TCAGTGCACA TCCTAACACA GTGGTCCTTA TTTTAGAAGA CTTCTGTAAA TAAGGCAAGG 2940 

TTTATCAGTG CAGATCATCA GAATTAAAGT TCAAGCAGGC GAGCAAGACA GTATACTTAA 3000 

GGGGTTGCAA AGCTTGGGAC TGGAAATTGT TTTGTTCTTG AAACAAAATA CTTCTTTAAG 3060 

GTTGCTTTTG CTGTTTQACT GCTGTCTACA TTCGTAAAAT TCTATTTTGT GAATTGGTAG 3120 

CTAAATCCCT TACTACCCTG ACACCGTGGT ATCTACTGTA TTTCTTTTCA AGGTGCAATT 31 80 

TGCTTCAGAG TTCCAATCAG CTAGATTAAG CAAGAGGCTC CAGAAGAAAT GTTTACTTGA 3240 

ATTTTGCGCT TCCTTTCTTO AIAGTTTCCT ATATAAAATT TGTCATTGAA CAAGAGCAAA 3300 

TGCTGAAGTA TTAATGAGGC ACAAATGACT GTGCCCCATT AGCAAGAATT CAGGAATCAA 3360 

TACAGACAGT ATTAAATTAA TAGCTTAAGT GAAGAAAAAA AAAAACTTAG TGAAAATGTA 3420 

TTAGCACGAT TAAATGGCAA AAGGACTTAT AAAAGGCAAG GGCATTAACT TTCAGTCCTG 3480 

CACAAAATAA AAAATTCCTC AOGACTCTCC ACTTTTACCA GTGGAGTTTG TCTTAGCTGA 3540 

CCTGTCGTCT TTCTCTTGAA GGAGGATTGC TGTAGACTTC TCTAGCTTGA ATATTGCAAC 3600 

ATAGCATCTT AGGTCTAGAT AGGGATGCTA ATGCCAGTTG TAGAAGTGTG AAAAAAGCAC 3660 

CTTGTATGTA GTAATGTATT TTATATCTTT GTTTTTTCTT TTACTGACTG TTTATAACAC 3720 

TCAATTGACA ATAGATATGA ACTGTATTTT AAATCATACT GTTAAATATT TTCCCTCTTT 3780 

TGTTGGGAAG CTCATTTTAG TTTAACCATG TTTGTTTTGT TGGTAGCTTA CCTGGAAGGC 3840 

AGTGACCACT TTTTTATATT CTCTTAATGA AACCATTCAG CAGGTATATG CTGTTGAGGC 3900 

TGGTTATAGA GGTTTTCTAT AATAAATGTT CAAGTATTTT TGTATATAAC TGGJTAATTT 3960 

TAATAAGAGA TACCATTATG TGTAAAAAAA AGTAAAAATA AACGCAAACA GTTGTTGATG 4020 

CAGTATGATT GTTATAATTA TGCCAAATAC TTTACGTATG GAAAAAGAAT ATTTGTACAT 4080 

ATGTGCTTTT AACAATTCTG CCATATTGAC TTTACAATTT TGAATGTCGG AAAAATTAAT 4140 

ATATGTTAAA TATTTATGTT TAGTGAAAGT GTTCATAATT GAGAAAAGGA ACATATGCAT 4200 

TTTAGCTTTG TATCTTGCAA GTTTTGCAGT CAGAAATTTT TTGAACTAGC TTTTGCTTTT 4260 

GATAACACTT CGTOTTTGTA ACCACATTCA TATATATATA CATATATATG TGAAGCTCCA 4320 

TATTTCTOTT GCTTTAAAGA AOTAAAACCT TCCATTTAAA TAAGATGACA TGCATAAGAT 4380 

AACAAAGCTT CCTTGATTTC CTTTTCCTGT GTAATTTAAT AGATTTGTTG ACTAGTGCTT 4440 

GGGCACATTA TAAATCAGTG TTATTTGCTC TTGGAGCCAT TTTTTAAAAA AAATTTTGGC 4500 

AGTGAGCAGT TGAATTTATC TTGAATTTAT CATGTGTGTG TATTTCTGAA GCAGCTACAT 4S60 

AGCAGAACAT TTTAAGAGAT TCTGTTAGCC CACATGTTCA TGTTGGTTGC TGCTGAATGG 4620 
TAAATATTAA ATAAAATTAC CAGATTAATC TT 



Seq ID NO: 61 Protein sequence ; 
Protein Accession #: NP_055520 



1 11 21 31 41 51 

1 I I I I I 

MAGKWRFINC YCNSSNGEW RLQNFYKTEL NKEEMYIRYI HKLYDLHLKA QNFTEAAYTL 60 

LLYDELLEWS DRPLREFLTY PMQTEWQRKE HLHLTXIQNF DRGKCWBNGI ILCRKIAEQY 120 

BSYYDYRNLS KMRMKKASLY DKIKDQQRLE PBFFRVGFYG KKFPFPLRKK BPVCRGHDYB 180 

RLEAFQQRML NEFPHAIAMQ HANQPDETIF QAEAQYLQIY AVTPIPBSQE VLQRBGVPDN 240 

IKSFYKVNHI WKFRYDRPFH KGTKDKENEP KSLWVERTSL YLVQSLPGIS RWFBVEKREV 300 

VEMSPLBNAI BVLENKNQQL KTLISQCQTR QMQNINPLTM CLNGVIDAAV NGGVSRYQEA 360 

PFVKEYILSH PEDGEKIARL RELMLEQAQI LEFGLAVHEK FVPQDMRPLH KKLVDQFFVM 420 

KSSLGIQEFS ACMQASPVHF PNGSPRVCRN SAPASVSPDG TRVIPRRSPL SYPAVNRYSS 480 

SSLSSQASAE VSNITGQSES SDEVFNMQPS PSTSSLSSTH SASPNVTSSA PSSARASPLlL 540 

SDKHKHSREN 8CLSPRERPC SAIYPTPVBP SQRMLFNH1G DGALPRSDPN LSAPEKASPA 600 

RHTTSVSPSP AGRSPliKGSV QSFTPSPVEY HSPGLISNSP VLSGSYSSGI SSLSRC8TSE 660 

TSGFENQVNB QSAPLPVPVP VPVPSYGGEB PVRKESKTPP PYSVYERTLR RPVPLPHSLS 720 
IPVTSBPPAL PPKPLAARSS HLENGARRTD PGPRPRPLPR KVSQL 



Seq ID NO: 62 Nucleotide sequence: 

Nucleic Acid Accession 8i fgenegh prediction 

Coding sequence: 1..2561 (underlined sequences correspond to start and stop ccdons) 



1 11 21 31 41 51 

I I I I I I 

ATGGACCGAG GCCAGGOTAA GAGGGGCCGC GACGCCCGCA CTTGTTGCGG CGOCGGGCGG 60 

GAAAGGGAGA CTGGACGATC TGAAGCCGGA GAGGAGGAGG GAGAGAGGOG GGCGGTGGGG 120 

GGGGGGCTGA GGAACGCTCG GAGGGGACTG GGAGACGCGG CGCTTATGCA AAGGTGCCTT 180 

CGGCTGCCGG GACAACCCGC CAGCAACCAG GTACAGCTCT CAGAGGTTCC ACAGAGGAAG 240 

CTCAGGGTCC CTGAATCTCC CAGTGTGGCA GAGAAAGTGA AACTTGGTCA CCGATGCCTG 300 
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GAACTGCTGG AGCAOCTGCT CCCAGAGCTC ACOOOGCTGC TCAGCCTCCT GGACCACGAa 360 

TACCTCAGCQ ATACCACCCT GGAAAAGAAG ATGGCCGTGG CCTCCATCCT GCAGAGCCTG 420 

CAGCCCCTTC CAGCAAAGGA GGTCTCCTAC CTGTATGTGA ACACAGCAGA CCTOCACTCG 460 

GGGCCCAGCT TOGTGGAATC CCTCTTTGAA GAATTTGACT GTGACCTGAG T6A0CTT0G6 540 

5 GACATGCCAO AGGATGATGG GGAGCCCAGC AAAGGAGCCA GCCCTGAGCT AGCCAAGAGC 600 

CCAOGCCTGA GAAACGCGGC CGACCTGCCT CCACCGCTCC CCAACAAGCC TCCCCCTOAG 660 

GACTACTATO AAGAGGCCCT TCCTCTGGGA CCCGGCAAGT CGCCTQAGTA CATCAOCTCC 720 

CACAATGOCT GCAGCCCCTC ACACTCGATT GTGGATGGCT ACTATGAGGA OGCAGACAGC 780 

AGCTACCCTQ CAACCAGGGT GAAOGGCGAG CTTAAGAGCT CCTATAATGA CTCTGAOGCA 840 

10 ATGAGCAGCT CCTATGAGTC CTAOGATGAA GAGGAGGAGG AAGGGAAGAG CCCGCAGCCC 900 

CGACACCAGT. GGCCCTCAGA GGAGGCCTCC ATGCACCTGG TOAGGGAATO CAGGATATGT 960 

GCCTTCCTGC TGOGGAAAAA GCGTTTCGGG CAGTGGGCCA AGCAGCTGAC GGTCATCAGG 1020 

GAGGACCAGC TCCTGTGTTA CAAAAGCTCC AAGGATCGGC AGCCACATCT GAGGTTGGCA 1080 

CTGGATACCT GCAGCATCAT CTACGTGCCC AAGGACAGCC GOCACAAGAG GCAOGAGCTG 1140 

15 CGTTTCACCC AGGGGGCTAC OGAGGTCTTG GTGCTGGCAC TGCAGAGCCO AGAGCAGGCC 1200 

GAGGAGTGGC TGAAGGTCAT CCGAOAAGTG AGCAAGOCAG TTGGGGGAGC TGAGGGAGTG 1260 

GAGGTCCCCA GATCCCCAGT CCTCCTGTGC AAGTTGGACC TGGACAAGAG GCTGTCCCAA 1320 

GAGAAGCAGA CCTCAGATTC TGACAGOGTG GGTGTGGGTG ACAACTGTTC TACCCTTGGC 1380 

CGCOGGGAGA CCTGTGATCA OGGCAAAGGG AAGAAGAGCA GCCTQGCAGA ACTGAAGGGC 1440 

20 TCAATGAGCA GGGCTGOGGG CCGCAAGATC ACCCGTATCA TTGGCTTCTC CAAGAAGAAG 1500 

ACACTGGCCG ATGACCTGCA GACGTCCTCC ACCGAGGAGG AGGTTOCCTQ CTGTGGCTAC 1560 

CTGAACGTGC TGGTOAACCA GGGCTGGAAG GAACGCTGGT GCOGCCTGAA GTGCAACACT 1620 

CTGTATTTCC ACAAGGATCA CATGGACCTG CGAACCCATG TGAACGCCAT CGCCCTGCAA 1660 

GGCTGTGAGG 7GGCCCCGGG CTTTGGGCCC CGACACCCAT TTGCCTTCAG GATCCTGGGC 1740 

25 AACCGGCAGG AGGTGGCCAT CTTGGAGGCA AGCTGTTCAG AGGACATGGG T03CTOGCTC 1800 

GGGCTGCTGC TGGTGGAGAT GGGCTCCAGA GTCACTCOGG AGGCGCTGCA CTATGACTAC 1860 

GTGGATGTGG AGACCTTAAC CAGCATCGTC AGTGCTGGGC GCAACTGCTT CCTATATGCA 1920 

AGATCCTGCC AGAATCAGTG GCCTGAGCCC CGAGTCTATG ATGATGTTCC TTATGAAAAG 1980 

ATGCAGGACG AGGAGCCCGA GOGCCCCACA GGGGCCCAGG TGAAGCGTCA CGCCTCCTCC 2040 

30 TGCAGTGAGA AGTCCCATCG TGTGGACCCG CAGGTCAAAG TCAAACGCCA CGCCTCCAGT 2100 

GCCAATCAA7 ACAAGTATGG GAAGAACCGA GCOGAGGAGG A1GCCCGGAG GTACTTGGTA 2160 

GAAAAAGAGA AGCTGGAGAA AGAGAAAGAG ACGATTGGGA CAGAGCTQAT AGCACTGAGA 2220 

CAGGAGAAGA GGGAACTGAA GGAAGCCATT CGGAGCAGCC CAGGAGCAAA ATTAAAGGCT 2280 

CTGGAAGAAG CCGTGGCCAC CCTGGAAGCT CAGTGTCGGG CAAAGGAGGA GCGCCGGATT 2340 

35 GACCTOGAGC TGAAGCTGGT GGCTGTGAAG GAGCGCTTGC AGCAGTCCCT GGCAGGAGGG 2400 

CCAGCCCTGG GGCTCTCCGT GAGCAGCAAG CCCAAGAGTG GGCAACTCTC TGAGGAAGAT 2460 

ACGCTCACCT CCAATGGTGC TCTCTCAGAG AGAACTTCTC TGACCTCATC TACACCAGGG 2520 
CTTCTCAACC CCAACACTAC TGACATTTTG GACCA GTAA 

40 

Seq ID NO: 63 Protein geauence: 

Protein Accession fh fgenesh prediction 



45 1 11 21 31 • 41 51 

I I I I I I 

MDRGQGKRGR DARTCCGAGR ERETGRSEAG EEBGERRAVG RGLRNARRGL GDAALMQRCL 60 

RLPGQPASNQ VQIiSEVPQRK LRVPESPSVA EKVKLGHRCL ELLEQLLPEI, TGLLSLLDHE 120 

YLSDTTLBKK MAVASILQSL QPLPAKBVSY LYVNTADLHS GPSFVESLPE EPDCDLSDLR 180 

50 DMPBDDGEPS KGASPELAKS PRLRNAADLP PPLPNKPPPE DYYEEALPLG PGKSPEYISS 240 

HNGCSPSHSI VDGYYEDADS SYPATRVNGE LKfiSYNDSDA MSSSYSSYDE EEEEGKSPQP 300 

RHQWPSEEAS MHLVRECRIC AFLLRKKRFG QWAKQLTVIR EOQLLCYKSS KDRQPHLRLA 360 

LDTCSIIYVP KDSRHKRHBL RFTQGATEVL VLALQSREQA EEHLKVIREV SKPVGGAEGV 420 

EVPR3PVLLC KLDLDKRLSQ BKQTSDSD3V GVGDKCSTLG RRETCDHGKG KKSSLABLKG 480 

55 SMSRAAGRKX TRIIGFSKKK TLADDLQTSS TEEEVPCCGY UiVLVNQGWK BRWCRLKCNT 540 

LYFHKDHMDL RTHVNAIALQ GCEVAPGFGP RHPFAFRILR NRQEVAILEA SCSBDM6RHL 600 

GLLLVEMGSR VTPEALHYDY VDVBTLTSIV SAGRN8PLYA RSCQNQWPEP RVYDDVPYEK 660 

MQDEEPERPT GAQVKRHASS CSEKSHRVDP QVKVKRHASS ANQYKYGKNR ABEDARRYLV 720 

EKEKLEKEKE TIRTELIALR QEKRELKEAI RS8PGAKLKA LEEAVATLEA QCRAKBERRI 780 

60 DLELKLVAVK BRLQQSLAGG PALGLSVSSK PK6GQLSEED TLTSNGALSB RTSLTSSTPG 840 
LLNPNTTDIL DQ 

Seq IO NO: 64 fflicleotide sequences 
Nucleic Acid Accession #: NM_004126.1 

65 Coding sequence j 108-129 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

70 GGCACGAGCT COTGCCGGCC TTCAGTTGTT TCGGGACGOG CCGAOCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGOGAAAAT9 CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT gcagagacaa caagtgtcta AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACXXCTTTA 300 

75 AAGAAAAAGG CAGCTGTGTT ATTTCATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 
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TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAQA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTQAT €00 
GCTTCAAATA AAGTITTGTC TT 

Seq ID NO: 65 Protein, sequence j 
Protein Accession ft: NP_004117 



1 11 * 21 31 41 51 

I I I I I I 

MPAIiHIEDLiP EKBKhKHBVB QLRXKVKLQR QQVSKCSKRI KNYIBERSGB DPLVKGIPED 60 
KMPPKBKGSC VIS 



Seq ID NO: 66 Nucleotide Beauence: 
Nucleic Acid Accession ft: NMJJ03 842.1 

Coding sequence: 1-1236 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGGAACAAC GGGGACAGAA 06CCCCGGCC GCTTCGGGGG CCCGGAAAAG GCACGGCCCA 60 

GGACCCAGGG AGGCGOGGGG AGCCAGGCCT GGGQCCCGGG TCCCCAAGAC CCTTOTGCTC 120 

GTTGTCGCCG CGGTCCTGCT GTTGGTCTCA GCTGAGTCTG CTCTGATCAC CCAACAAGAC 180 

CTAGCTCCCC AGCAGAGAGC GGCCCCACAA CAAAAGAGGT CCAGCCCCTC AGAGGGATTG 240 

TGTCCACCTG GACACCATAT CTCAGAAGAC GGTAGAGATT GCATCTCCTG CAAATATGGA 300 

CAGGACTATA GCACTCACTO GAATGACCTC CTTTTCTGCT TGCGCTGCAC CAGGTGTGAT 360 

TCAGGTGAAG TGGAGCTAAG TCCCTGCACC ACGACCAGAA ACACAQTGTG TCAGTGCGAA 420 

GAAGGCACCT TCCGGGAAGA AGATTCTCCT GAGATGTGCC GGAAGTGCCG CACAGGGTGT 480 

CCCAGAGGGA TGGTCAAGGT CGGTGATTGT ACACCCTGGA GTGACATCGA ATGTGTCCAC 540 

AAAGAATCAG GCATCATCAT AGGAGTCACA GTTGCAGCCG TAGTCTTGAT TGTGGCTGTG 600 

TTTGTTTGCA AGTCTTTACT GTGGAAGAAA GTCCTTCCTT ACCTGAAAGG CATCTGCTCA 660 

GGTGGTGGTG GGGACCCTGA GCGTGTGGAC AGAAGCTCAC AACGACCTGG GGCTGAGGAC 720 

AATGTCCTCA ATGAOATCGT GAOTATCTTG CAGCCCACCC AGGTCCCTGA GCAGGAAATG 780 

GAAGTCCAGG AGCCAGCAGA GCCAACAGGT GTCAACATGT TGTCCCCCGG GGAGTCAGAG 840 

CATCTGCTGG AACCGGCAGA AGCTGAAAGG TCTCAGAGGA GGAGOCTGCT GGTTCCAGCA 900 

AATGAAGGTQ ATCCCACTGA GACTCTGAGA CAGTGCTTCG ATGACTTTOC AGACTTGGTG 960 

CCCTTTOACT CCTGGGAGCC GCTCATGAGG AAGTTGGGCC TCATGGACAA TGAGATAAAG 1020 

GTGGCTAAAG CTCAGGCAGC GGGCCACAGG GACACCTTGT ACACGATGCT GATAAAGTGG 1080 

GTCAACAAAA CCGGGCGAGA TGCCTCTGTC CACACCCTGC TGGATGCCTT GGAGACGCTG 1140 

GGAGAOAGAC TTGCCAAGCA GAAGATTGAG GACCACTTGT TGAGCTCTGG AAAGTTCATG 1200 
TATCTAGAAG GTAATGCAGA CTCTGCCATG TCCTAA 

Seq ID NOi 67 protein sequence: 
Protein Accession ft: NP_003 833.1 

1 11 21 31 41 51 

I I I I I I 

MEQRGQNAPA ASGARKRHGP GPREARGARP GPRVPKTLVL WAAVLLLVS AESALITQQD 60 

IiAPQQRAAPQ QKRSSPSEGL CPPGKKISBD GRDCISCKYG ODYSTHWNDL LFCLRCTRCD 120 

SGBVEIiSPCT TTRNTVCQCE EOTFRBBDSP EMCRKCRTGC PRGMVKVGDC TPWSDIECVH 180 

KESGIIIGVT VAAWLIVAV FVCKSLLWKK VLPYLKGICS GGGGDPERVD RSSQRPGAED 240 

NVLNBIVSIL QPTQVPEQEM KVQBPAEPTG VNKLSPGRS8 HLLEPAEAER SQRRRLLVPA 300 

NEGDPTETLR QCFDDPADLV PFDSWEPLMR KLGLMDNEIK VAKAEAAGHR DTLYTMLIKW 360 
VNKTGRDASV HTLLDALETL GERLAKQKIE DHLLSSGKFH YLBGNADSAM S 

Seq ID NO: 68 Nucleotide sequence: 

Nucleic Acid Accession ft: PGENBSH predicted ORP 

Coding sequence: 361- 2220 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 SI 

I I I 1 I I 

GGCACCATCT GCTCCCTGCC CTGCCCAGAG GGCTTTCACG GACCCAACTG CTCCCAGGAA 60 

TGTCGCTGCC ACAACGGCGG CCTCTGTGAC CGATTCACTG GGCAGTGCCG CTGCGCTCCG 120 

GGTTACACTG GGGATCGGTG CCGGOAGGAG TGCCCGGTGG GCCGCTTTGG GCAGGACTGT 180 

GCTGAGACGT GCGACTGCGC CCCQGACGCC CGTTGCTTCC CGGCCAACGG CGCATGTCTG 240 

TGCGAACACG GCTTCACTGG GGACCGCTGC ACGGATCGCC TCTGCCCCGA CGGCTTCTAC 300 

GGTCTCAGCT GCCAGGCCCC CTGCACCTGC GACCGGGAGC ACAGCCTCAG CTGCCACCCG 360 

ATGAACGGGG AGTGCTCCTG CCTGCCGGGC TGGGCGGGCC TCCACTGCAA CGAGAGCTGC 420 

CCGCAGGACA CGCATGGGCC AGGGTGCCAG GAGCACTGTC TCTGCCTGCA CGGTGGCGTC 480 

TGCCAGGCTA CCAGCGGCCT CTGTCAGTGC GCGCCGGGTT ACACGGGCCC TCACTGTGCT 540 

AGTCTTTGTC CTCCTGACAC CTACGGTGTC AACTGTTCTG CACGCTGCTC ATGTGAAAAT 600 

GCCATCGCCT GCTCACCCAT CGACGGCGAO TGCOTCTGCA AGGAAGCTTG GCAGCGTGGT 660 

AACTGCTCTG TGCCCTGCCC ACCCGGAACC TGGGGCTTCA GTTGCAATGC CAGCTGCCAG 720 

TGTGCCCATG AGOCAGTCTG CAGCCCCCAA ACTGGAGCCT GTACCTGCAC CCCTGGGTGG 780 
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CATGGGGCCC ACTGCCAGCT GCCCTGTCCG AAGOGGCAGT TTGGAGAAOQ TTGTGCCAGT 840 

CGCTGTGACT GTOACCACTC TGATGGCTGT QACCCTOTTC ATGGACGCTG TCAGTGCCAG 900 

GCTGGCTGGA TGGGTGCCCG CTGCCACCTG TCCT0CCCTO AGGGCTTATG GGGAGTCAAC 960 

TGTAGCAACA CCTGCACCTG CAAGAATGGG GGCACCTGTC TCCCTGAGAA TGGCAACTGC 1020 

5 GTGTGTGCAC CCGGATTCCG GGGCCCCTCC TGCCAGAGAT CCTGTCAGCC TGGCCGCTAT 1080 

GGCAAACGCT GTGTGCCCTQ CAAGTGCGCT AACCACTCCT TCTGCCACCC CTCOAACGGG 1140 

ACCTGCTACT GCCTGGCTGG CTGGACAGGC CCOGACTGCT COCAGOGCTG CCCTCTGCGG 1200 

ACATTTGGTG CTAACTGCTC CCAGCCATGC CAGTGTGOTC CTGGAGAAAA GTGCCACCCA 1260 

GAGACTGGGG CCTGTGTATG TCCCCCAGGG CACAQTGOTG CACCTTGCAG GATTGOAATC 1320 

10 CAGQAGCCCT TTACTGTGAT GCCGACCACT CCAGTAGCGT ATAACTCGCT GGGTGCAGTG 1380 

ATTGGCATTG CAOTGCTGGG GTCCCTTGTG OTAGCCCTOG TGGCACTGTT CATTGGCTAT 1440 

CGGCACTGGC AAAAAGGCAA GGAGCACCAC CACCTGGCTG TGGCTTACAG CAGOGOGOGC 1S00 

CTGGACGGCT COGAGTATGT CATGCCAGAT GTCOCTCCGA GCTACAGTCA CTACTACTCC 1560 

AACCCCAGCT ACCACACCCT GTOQCAGTGC TCCOCAAACC CCCCACCCCC TAACAAGGTT 1620 

15 CCAGGCCOGC TCTTTGCCAG CCTGCAGAAC CCTGAGGGGC CAGGTGGGGC CCAAGGGCAT 1680 

GATAACCACA CCACCCTGCC TGCTGACTGG AAGCACCGCC GGGAGCCCCC TCCAGGGCCT 1740 

CTGGACAGGG GGAGCAGCOG CCTGGACCGA AGCTACAGCT ATAGCTACAG CAATGGCCCA 1800 

GGCCCATTCT ACAATAAAGG GCTCATCTCT GAAGAGGAGC TCGGGGCCAG TGTGGCTTCC 1860 

CTGAGCAGTG AGAACCCATA TGCCACCATC CGGGACCTGC CCAGCTTGCC AGGGGGCCCC 1920 

20 CGGGAGAGCA GCTACATGGA GATGAAAGGC CCTCCCTCAG GATCTCCCCC CAGGCAGCCT 1980 

CCTCAGTTCT GGGACAGCCA GAGGCGGCGG CAACCCCAGC CACAGAGAGA CAGTGGCACC 2040 

TACGAGCAGC CCAGCCCCCT GATCCATGAC OGAGACTCTG TGGGCTCCCA GCCCCCTCTG 2100 

CCTCCGGGCC TACCCCCCGG CCACTATGAC TCACCCAAGA ACAGCCACAT CCCTGGACAT 2160 
TATGACTTGC CTCGAGTACG GCATCCCCCA TCACCTCCAC TTCGACGCCA GGACCGT TGA 



Seq ID NO: 69 Proton sequence; 

Protein Accession #: FGENRSH prediction 

30 1 11 21 31 41 51 

I I I I I I 

GTICSLPCPB GPHGPNCSQE CRCHNGGLCD RFTGOCRCAP GYTGDRCREE CPVGRPGQDC 60 

AETCDCAPDA RCFPANGACL CBHGFTGDRC TDRLCPDGFY GLSCQAPCTC DREHSLSCHP 120 

KNGBCSCliPG WAGLKCNESC PQDTKGPGCQ EHCLCLHGGV CQATSGLCQC APGYTGPHCA 180 

35 SLCPPDTYGV NCSARCSCEN AIACSPIDGE CVCKEGWQRG NCSVPCPPGT WGPSCNASCQ 240 

CAHEAVCSPQ TGACTCTPGW HGAHCQLPCP KGQPGEGCAS RCDCDHSDGC DPVHGRCQCQ 300 

AGWKGARCHL SCPEGLWGVN CSNTCTCKNG GTCLPENGNC VCAPGPRGPS CQRSCQPGRY 360 

GKRCVPCKCA NHSFCHPSNG TCYCLAGWTG PDCSQRCPLG TFGANCSQPC QOGPGEKCHP 420 

ETGACVCPPG HSGAPCRIGI QEPPTVMPTT PVAYNSLGAV IGIAVLGSLV VALVAL FIGY 480 

40 RHWQKGKEKH HLAVAYSSGR LDGSEYVMPD VPPSYSHYYS NPSYHTLSQC SPNPPPPNKV 540 

PGPLPASLQN PERPGGAQGH DNHTTLPADW KHRREPPPGP LDRGSSRLDR SYSYSYSNGP 600 

GPPYNKGLIS EEELGASVAS LSSENPYATI RDLPSLPGGP RBSSYMEMKG PPSGSPPRQP 660 

PQPWDSQRRR QPQPQRDSGT YEQPSPLIHD RDSVGSQPPL PPGLPPGHYD SPKNSHIPGH 720 
YDLPPVRHPP SPPLRRQDR 



Seq ID NO: 70 Nucleotide Beq^eRcq? 
Nucleic Acid Accession U; NM_005458 

Coding sequence: 1..2826 (underlined sequences correspond to start and stop codona) 

50 

1 11 21 31 41 51 

I I I I i I 

ATGGCTTCCC CGCGGAGGTC CGGGCAGCCA GGGOGGCCGC CGCCGCCGCC ACCGCCGCCC 60 

GCGCGCCTGC TACTGCTACT GCTGCTGCCG CTGCTGCTGC CTCTGGCGCC CGGGGCCTGG 120 

55 GGCTGGGCGC GGGGCGCCCC CCGGCCGCCG CCCAGCAGCC CGCCGCTCTC CATCATGGGC 180 

CTCATGCCGC TCACCAAGGA GGTGGCCAAG GGCAGCATCG GGCGCGGTGT GCTCCCOGCC 240 

GTGGAACTGG CCATCGAGCA GATCCGCAAC GAGTCACTCC TGCGCCCCTA CTTCCTCGAC 300 

CTGCGGCTCT ATGACACGGA GTGCGACAAC GCAAAAGGGT TGAAAGCCTT CTACGATGCA 360 

ATAAAATACG GGCCGAACCA CTTGATGGTG TTTGGAGGCG TCTGTCCATC CGTCACATCC 420 

60 ATCATTGCAG AGTCCCTCCA AGGCTGGAAT CTGGTGCAGC TTTCTTTTGC TGCAACCACG 480 

CCTGTTCTAG CCGATAAGAA AAAATACCCT TATTTCTTTC GGACCGTCCC ATCAGACAAT 540 

GCGGTGAATC CAGCCATTCT GAAGTTGCTC AAGCACTACC AGTGGAAGCG CGTGGGCACG 600 

CTGACGCAAG ACGTTCAGAG GTTCTCTGAG GTGCGGAATG ACCTGACTGG AGTTCTGTAT 660 

GGCGAGGACA TTGAGATTTC AGACACCGAG AGCTTCTCCA ACGATCCCTG TACCAGTGTC 720 

65 AAAAAGCTGA AGGGOAATGA TGTGCGGATC ATCCTTGGCC AGTTTGACCA GAATATGGCA 760 

GCAAAAGTGT TCTGTTGTGC ATACGAGGAG AACATGTATG GTAGTAAATA TCAGTGGATC 840 

ATTOCGGGCT GGTACGAGCC TTCTTGGTGG GAGCAQGTGC ACACGGAAGC CAACTCATCC 900 

CGCTGCCTCC GGAAGAATCT GCTTGCTGCC ATGGAGGGCT ACATTGGCGT GGATTTCGAG 960 

CCCCTGAGCT CCAAGCAGAT CAAGACCATC TCAGGAAAGA CTCCACAGCA GTATGAGAGA 1020 

70 GAGTACAACA ACAAGCGGTC AGGCGTGGGG CCCAGCAAGT TCCACGGGTA CGCCTACGAT 1080 

GGCATCTGGG TCATCGCCAA GACACTGCAG AGGGCCATGG AGACACTGCA TGCCAGCAGC 1140 

CGGCACCAGC GGATCCAGGA CTTCAACTAC ACGGACCACA CGCTGGGCAG GATCATCCTC 1200 

AATGCCATGA ACGAGACCAA CTTCTTCGGG GTCAOGGGTC AAGTTGTATT CCGGAATGGG 1260 

GAGAGAATGG GGACCATTAA ATTTACTCAA TTTCAAGACA GCAGGGAGGT GAAGGTGGGA 1320 

75 GAGTACAACG CTGTGGCCGA CACACTGGAG ATCATCAATG ACACCATCAG GTTCCAAGGA 1380 

TCCGAACCAC CAAAAGACAA GACCATCATC CTGGAGCAGC TGCGGAAGA.T CTCCCTACCT 1440 
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CTCTACAGCA TCCTCTCTGC CCTCACCATC CTCGGGATGA TCATGQCCAG TGCTTTTCTC 1500 

TTCTTCAACA TCAAGAACCG GAATCAGAAG CTCATAAAGA TOTOGAGTCC ATACATGAAC 1560 

AACCTTATCA TCCTTGGAGQ GATOCTCTCC TATOCTTCCA TATTTCTCTT TGOCCTTQAT 1620 

GGATCCTTTG TCTCTGAAAA GACCTTTGAA ACACTTTGCA CCGTCAGGAC CTGGATTCTC 1680 

5 ACCGTGGGCT ACACGACCGC TTTTGGGGCC ATGTTTGCAA AGACCTGGAG AGTCCACGCC 1740 

ATCTTCAAAA ATGTGAAAAT GAASAAGAAG ATCATCAAGG ACCAGAAACT GCTTGTGATC 1600 

GTGGGGGGCA TGCTGCTGAT CGACCTGTGT ATCCTGATCT GCTGGCAGGC TGTGGACCCC I860 

CTGCGAAGGA CAGTGGAGAA GTACAGCATG GAGCCGGACC CAGCAGGACG GGATATCTCC 1920 

ATCCGCCCTC TCCTGGAGCA CTGTQAGAAC ACCCATATGA CCATCTGGCT TGGCATCGTC 1980 

10 TATGCCTACA AGGGACTTCT CATOTTGTTC GGTTGTTTCT TAGCTTGGGA GACCCGCAAC 2040 

GTCAGCATCC CCSCACTCAA OGACAGCAAG TACATCGGGA TGAGTGTCTA CAACQTGGGG 2100 

ATCATGTGCA TCATOGGGGC CGCTGTCTCC TTCCTGACCC GGGACCAGCC CAATGTGCAG 2160 

TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAGCACCA TCACCCTCTG CCTGGTATTC 2220 

GTGCOGAAGC TCATCACCCT GAGAACAAAC CCAGATGCAG CAACGCAGAA CAGGCGATTC 2280 

15 CAGTTCACTC AGAATCAGAA GAAAGAAGAT TCTAAAACGT CCACCTCGGT CACCAGTGTG 2340 

AACCAAGCCA GCACATCCCG CCTGGAGGGC CTACAGTCAG AAAACCATCG CCTGCOAATO 2400 

AAGATCACAG AGCTGGATAA AGACTTGGAA GAGGTCACCA TGCAGCTGCA GGACACACCA 2460 

GAAAAGACCA CCTACATTAA ACAGAACCAC TACCAAGAGC TCAATGACAT CCTCAACCTG 2520 

GGAAACTTCA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAAAAATCA CCTCGATCAA 2580 

20 AATCCCCAGC TACAGTGGAA CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 2640 

GATATAAACT CTCCAGAACA CATCCAQCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 2700 

CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCT GTGTCAGCCC CTGCGTCAGC 2760 

CCCACCGCCA GCCCCOGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 2820 
CTGTAA 



Seq ID NO: 71 Protein sequence; 
Protein Accession ft: NP__005449 

30 1 11 21 31 41 51 

I I I I I I 

MASPRRSGQP GRPPPPPPPP ARLLLLLLU LLLPLAPGAW GWARGAPRPP PSSPPLSIHG 60 

LMPLTKEVAK GSIGRGVLPA VELAIEQIRN ESLLRPYFLD LRLYDTECDN AKGLKAFYDA 120 

IKYGPNHLMV FGGVCPSVTS IIABSLQGWN LVQLSFAATT PVLADKKKYP YFPRTVPSDN 180 

35 AVNPAILKLL KHYQWKRVGT LTQDVQRF8E VRNDLTGVLY GBDIEISDTE SPSMDPCTSV 240 

KKLKGNDVRI ILGQPDQNMA AKVFCCAYEE NMYGSKYQWI IPG WYE PS WW EQVHTEANSS 300 

RCLRKKLLAA MEGYIGVDFE PhSSKQIKTl SGKTPQQYER EYNNKRSGVG PSKPHGYAYD 360 

GIWVTAKTLQ RAMETItHASS RHQRIQDFNY TDHTLGRIIL NAMNETNFFG VTGQWFRNG 420 

ERHGTIKPTQ FQDSREVKVG EYHAVADTLB IINDTIRFQG SEPPKDKTII LEQLRKISLP 480 

40 LYSILSALTI LGMIMASAFIi FFNIKNRNQK LIKKSSPYMN NLIILGGMLS YASIPLFGLD S40 

GSFVSEKTFE TLCTVRTWIL TVGYTTAFGA MPAKTWRVHA IPKNVKMKKK IIKDQKLLVI 600 

VGGMLLIOLC ILICWQAVDP LRRTVEKYSM EPDPAGRDI6 IRPLLEHCEN THMTIWLGIV 660 

YAYKGLLMLP GCPIAHETRN VSIPALNDSK YIGMSVYNVG IMCIIGAAVS PLTRDQPNVQ 720 

FCIVALVIIP CSTITLCLVP VPKLITLRTN PDAATQNRRP QPTQNQKKED SKTSTSVTSV 780 

45 NQASTSRLEG LQSENHRLRM KITELDKDLB EVTMQLQDTP EKTTYIKQNH YQELNDILNL 640 

GNFTESTDGG KAILKNHLDQ KPQLQWNTTE PSRTCKDPIE DINSPEHIQR RLSLQLPILH 900 
HAYLPSIGGV DASCVSPCVS PTASPRHRHV PPSFRVMVSG L 

Seq ID NO: 72 Nucleotide aeouence: 
50 Nucleic Acid Accession #: NM_00579S 

Coding sequence! 522-1940 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

« I I I I I I 

JJ GCACGAGGGA ACAACCTCTC TCTCTSCAGC AGAGAGTGTC ACCTCCTGCT TTAGGACCAT 60 

CAAGCTCTGC TAACTGAATC TCATCCTAAT TGCAGGATCA CATTGCAAAG CTTTCACTCT 120 

TTCCCACCTT GCTTGTGGGT AAATCTCTTC TGCGGAATCT CAGAAAGTAA AGTTCCATCC 180 

TGAGAATATT TCACAAAGAA TTTCCTTAAG AGCTGGACTG GGTCTTGACC CCTGGAATTT 240 

AAGAAATTCT TAAAGACAAT GTCAAATATG ATCCAAGAGA AAATGTGATT TGAGTCTGGA 300 

60 GACAATTGTG CATATCGTCT AATAATAAAA ACCCATACTA GCCTATAGAA AACAATATTT 360 

GAATAATAAA AACCCATACT AGCCTATAGA AAACAATATT TGAAAGATTG CTACCACTAA 420 

AAAGAAAACT ACTACAACTT GACAAGACTG CTGCAAACTT CAATTGGTCA CCACAACTTG 480 

ACAAGGTTGC TATAAAACAA GATTGCTACA ACTTCTAGTT TATGTTATAC AGCATATTTC 540 

ATTTGGGCTT AATGATGGAG AAAAAGTGTA CCCTGTATTT TCTGGTTCTC TTGCCTTTTT 600 

65 TTATGATTCT TGTTACAGCA GAATTAGAAG AGAGTCCTGA GGACTCAATT CAGTTGGGAG 660 

TT ACT AG AAA TAAAATCATQ ACAGCTCAAT ATGAATGTTA CCAAAAGATT ATGCAAGACC 720 

CCATTCAACA AGCAGAAGGC GTTTACTGCA ACAGAACCTG GGATGGATGG CTCTGCTGGA 780 

ACGATGTTGC AGCAGGAACT GAATCAATGC AGCTCTGCCC TGATTACTTT CAGGACTTTG 840 

ATCCATCAGA AAAAGTTACA AAGATCTGTG ACCAAGATGG AAACTGGTTT AGACATCCAG 900 

70 CAAGCAACAG AACATGGACA AATTATACCC AGTGTAATGT TAACACCCAC GAGAAAGTGA 960 

AGACTGCACT AAATTTGTTT TACCTGACCA TAATTGGACA CGGATTGTCT ATTGCATCAC 1020 

TGCTTATCTC GCTTGGCATA TTCTTTTATT TCAAGAGCCT AAGTTGCCAA AGGATTACCT 1080 

TACACAAAAA TCTGTTCTTC TCATTTGTTT GTAACTCTGT TGTAACAATC ATTCACCTCA 1140 

CTGCAGTGGC CAACAACCAG GCCTTAGTAG CCACAAATCC TGTTAGTTGC AAAGTGTCCC 1200 

75 AGTTCATTCA TCTTTACCTG ATGGGCTGTA ATTACTTTTG GATGCTCTGT GAAQGCATTT 1260 

ACCTACACAC ACTCATTGTG GTGGCOGTGT TTGCAGAGAA GCAACATTTA ATGTGGTATT 1320 
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ATTTTCTTGG CTGGGGATTT CCACTGATTC 
TATATTACAA TGACAATTGC TGGATCAGTT 
GCCCAATTTG TGCTGCTTTA CTGGTGAATC 
TCATCACCAA GTTAAAAGTT ACACACCAAG 
GAGCTACTCT TATCTTGGTG CCATTGCTTG 
CTGAAGGAAA GATTGCAGAG GAGGTATATG 
AGGOTCTTTT GOTCTCTACC ATTTTCTGCT 
GAAGAAACTG GAATCAATAC AAAATCCAAT 
TTCGTAQTGC GTCTTACACA GTGTCAACAA 
GTCCTAGTQA ACACTTAAAT GGAAAAAGCA 
CAOAAAATTT ATATAATTQA AAATAGAAGG 
AACTCAAGGA CTTGGACCCA TQACTCTGTA 
GGGAATGTCA TAAAGAAGA6 CCTTCACATQ 
ATCCAGCTCT ATGTGGGAAA AAAGAAATCC 
CACTATGCCT GATGTGACGC TACTAACCTG 
ACAATCAACT TTTCTGAGCT GGTGTAAGCC 
AAATGGCTGT AAAACTAAAC ATACATGTTG 
GACCTAGCTA AGGTCTATAA ACATGAAGGG 
TCCCATCTTG ATTGGGGCAG TTGACTTTTT 
TAACTACCCT CTCAAATGGA CAATACCAGA 
CTATGAAAAG CAACTGAGTA CAATTGTTAT 
ATCTTGTGGC ATATCCATTG TGGAAACTGG 
TTCTATATCA TTAGGAAAAC ATCTTAGTTG 
TGTCTTACCA AACAGTGGGA GGGAATTCCT 
TTCTACTGTA TAAACAAATT AGCAATCATT 
TATTTTCTTG GAATTTTGTA AAAAGAAATT 
TTTTATTTTA TAGTCTCAAA TCAAATACAT 
TAATGCAACA ATGTGTGTAT GTTAATATCT 
AAATAGAGTC TGGAATGCTA TATTTGGTAA 
AGAAGTCTGT TTGAGAACTA AGAGAACAGA 
AAACACAAG6 TCACTATTTT ACTGAATATA 
GGTGTGTTTO ACATATTTCT TTTTTCATTT 
TTTTAAACAA CTACTGTGAT AAATACCAAT 
AATATTACTT TACTGACTTT TACTATGTGA 
ATTCAAGAAA TATAAAAAAC TAGAAGGATA 
TTTAATAGAG CTACTGTATA TAATACAAAT 
AAAATTATTG TCAGATCTTA CTGAATTATT 
AACCTTGCTA ATGAATTAAA GTGAAATTTG 
CCGCTGAAAT CTCTAAAGAA CAAGAATGAC 
GTCATGGGTA TCTGTTTTTT AAGTGTGTCA 
CATCATAAGT TGTTTCTTAA GCTQTCAATA 
TCAAATTGCT AAGACAAATT ATCTAAATTC 
AGTACATTTA TAATTTATCT ATGCATGAAA 
ATAGCAAGCT GCCATAGAAA GQA 



CTGCTTGTAT ACATGCCATT GCTAGAAGCT 1380 

CTGATACCCA TCTCCTCTAC ATTATCCATQ 1440 

TTTTTTTCTT GTTAAATATT GTACGCGTTC 1500 

CGGAATCCAA TCTGTACATG AAAGCTGTGA 1560 

GCATTGAATT TGTGCTGATT CCATGGCGAC 1620 

ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1600 

TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860 

TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

ATGGTTQTCT CACTOTTTGG TGCTTCTCCT 1980 

GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

ACATCACCAA GTGTGGAATT GGAGAAAAGC 2220 

AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

AAAATTAGCT TTTAGTTTTA AAACTCTTTA 2400 

TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATGAACAGGA TGTATAATAT GCAATCTTAC 2640 

ATGCTACAAA ACACCTTGTC AACCTCTTCC 2700 

AGCTGTAAAT ATAAATTTTG TCCCTTCCAT 2760 

TTATATAAAG AAAATCAATG AAGGATTTCT 2820 

GTGAAAAATG AGCTTGTAAA TACTCCATTA 2880 

ACAACCTATG TAATTTTTAA AGCAAATATA 2940 

GATACTGTAT CTGGGCTGAT TTTTTAAATA 3000 

ATATTTTAAA GACAACCAGA TGCCAGCATC 3060 

AACATCTATC ATAAGATATA TTTATTTTAA 3120 

TTTGTTTTGA TAACTCATAC CTTAATAATA 31 B0 

TGACAATGAA CTCACATTCT AATCCAGAAA 3240 

CTGCTACTTT TATAGATTTT ACCCCATTAA 3300 

AGATATATAG CTTTGGAAAT GTCCCAGGCT 3360 

CTATATATAC CATATACAAT GCTTTAATAT 3420 

TAGGGAAATA CTTGAATATA TCATTGAGAA 3480 

GTCAGACTTT ATTAAATAAA GATAGAAGAA 3540 

CATGGGATTC AGTTTCTCTA ATGTTATTTT 3600 

TTCAATTAGT AAAAGTCAAT TTTGGGAAAA 3660 

ATCTGATTAA AATGGATGAA ACAAATTACT 3720 

TGTCAATAGA TGGTGAGTTC AGAACTTATT 3780 

GTAAGAATTA ACATATAGAA TGGTCTGGTC 3840 

AAGTATTGTT TTGTTTGAAA CATGAATTTC 3900 



Seq ID NO: 73 Protein sequence ,^ 
Protein Accession ft: NM_005795 



1 11 21 31 41 51 

I I I I I I 

MLYSIFHLGL MMEKKCTLYP LVLLPFPMIIi VTAELEESPE DSIQLGVTRN KIMTAQYECY ■ 60 

QKIMQDPIQQ AEGVYCNRTW DGWLCWNDVA AGTESMQLCP DYPQDFDPSE KVTKICDQDG 120 

NWFRHPASNR TWTNYTQCNV NTHEKVKTAL NLFYLTIIGH GLSIASLLIS LGIFFYFKSL 180 

SCQRITLHKN LFFSFVCNSV VTIIHLTAVA NNQALVATNP VSCKVSQPIH LYLMGCKYPW 240 

MLCEGIYLHT LIWAVFAEK QHLMWYYFLG WGFPLIPACI HAIARSLYYN DNCWISSDTH 300 

LLYIIHGPIC AALLVNLFFL LNXVRVLITK LKVTHOAESN LYMKAVRATL ILVPLLGIBF 360 

VLIPWRPEGK IABEVYDYIM HILMHFQGLL VSTIPCFFNG EVQAILRRNW MQYKIQFGNS 420 
FSNSEALRSA SYTVSTISDG PGYSHDCPSB HLNGKSIHDI ENVLLKPENL YN 

Seq ID NO: 74 Nucleotide sequence : 
Nucleic Acid Accession ft: NM_0004S0.1 

Coding sequence: 117.. 1949 {underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAQAGATCC TGTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTCATGA 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 180 

GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAAGGTACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 

TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACTGGGCT CCAGGTGAAC 420 

CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 480 

TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TGCCCTATGC TACACAGCTG 540 

CCTGTACCAA TACATCCTGC AGTGGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 600 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CTTGCAAGTG 
CCCTGGAATC 
ACAATTCTTC 
TGCAGTGTAT 
GTGATGCTGT 
TCCCATGGAA 
AGAGCCTTCA 
TGACATGCAG 
CTGGAGAGTT 
AGGGACCAGC 
GTGAAGCTTT 
CTAGTGCTTC 
TTGTGTTGAA 
AGCCCACATG 
GGTGTGCTCA 
AGGAGGGATT 
CAGAAQAGGT 
TCAACATGAG 
AAGGATGGAC 
GCCTQCTACC 
CTGCTGCTGO 
TACGGAAAGC 
GCTACCAAAA 
GGAACTAGAG 
CTTCTTGCCT 
AATAGATCAA 
TCCTACTCTC 
CAAAGGTGAA 
CGCTGTAAAA 
CAGTGTTTCG 
GTTTAGAGGA 
CCCACAGGCA 
AGGGTTACTA 
GTTCTTAAGG 
TCATTCAATA 
GGTAAAAGTT 
TGTTTTAAAT 
AGAATAAAOT 
AGTAAAAACT 
CCACGATGAA 
CACACAAAGG 
GCTTTGCATT 
TATGGAAGAT 
TTTAACGAAT 
GCTCTGGAAG 
AACAATTCCA 
AGTAATTGCC 
CCATTAACTT 
AACGACAAAG 
TTAAAGGGGC 
ATGGAATACA 
GCATTAGAAA 
TTTAAATTAT 
TCAGACCTAT 



TGACCCTGGC 
CCCTGAGCAT 
CTGCTCTATC 
GTCCTCTGGA 
GACAAATCCA 
CACAACCTGT 
GTGTACCTCA 
GGCCGTCOGC 
CACCTTCAAA 
CCAGGTTGAA 
CCAGTGCACA 
TGGCAGTTTC 
GOGATCCAAA 
TGAAGCTGTG 
TTCCCCTATT 
TGAATTATAT 
TCCTTCCTGC 
CTGCAGTGGG 
GCTCAATGGC 
TACCTGTGAA 
ACTCTCCCTC 
AAAGAAATTT 
GCCTTCTTAC 
GGATACACTG 
ACTATGCCAG 
AGTCCAGCAG 
AGGATCAAGA 
GAGACCAAGA 
TCTTGGCACA 
ACAGCTGATT 
AAAAAATGAC 
AATGCATGGA 
TGCACAATTT 
ATTGTGATAT 
CAAGTGTGGT 
ACTTATTCTA 
TTATTTCAAA 
TCTOAGTGTT 
GAATGGAAGG 
AAACTTCCAT 
TATAATTTTA 
CCTACAAAGA 
TTTAAATTCA 
GAAGATGTCT 
AGAGGAATGC 
AAGGAATCTC 
AAAGCTGCTC 
AGCATGTGTT 
CCAACAGTCA 
AGAAAAACTC 
GTGTTATTTT 
TTAGCTGTGT 
AACTTAAAAT 
TTGACATAAC 



TTCAGTGGAC 
GGAAGCCTGG 
AGCTGTGATA 



GCCAATGGGT 
ACATTTGACT 
TCTGGGAATT 
CAGCCTCAGA 
TCATCCTGCA 
TGCACCACTC 
GCCTTGTCCA 
CGTTATGGGT 
AGGCTCCAAT 
AGATGCGATG 
GGAGAATTCA 
GGATCAACTC 
CAAGTGGTAA 
GAGCCCGTGT 
TCTGCAGCTC 
GCTCCCACTG 
CTGACATTAG 
GTTCCTGCCA 
ATCCTTTAAG 
AAGTTAACAG 
ATGCCTTTAT 
GCAAGGACGG 
AAGTGTTGGC 
CTCTGAAATC 
GAAACACAAT 
ACACAGTTGC 
TAAAAATATT 
GGGTTGTTAA 
AATCACTTTC 
TTTTACTTGC 
AGGGACTTAA 
GATTACCCCC 
AAGGGAAACT 
TTTACTACAG 
TTTGTATATT 
GAGGCCAAAC 
TGAATGTCTT 
TGTTTGTCAG 
CAATAGAAAC 
AATAGTTATT 
CTGTGTGAGC 
CAGTTTTCAG 
TAGCCTTGAG 
GAAAAAAAAA 
AAACAGAGAT 
TGGGAAATAA 
CTTTGAAATT 
GAAATACCAG 
ATTTTATAAT 
ACTATAAAGG 



TCAAGTGTGA 
TTTGCAGTCA 
GGGGTTACCT 
CTCCTATTCC 
TOGTGGAATG 
GTGAAGAAGG 
GGGACAACQA 
ATGGCTCTGT 
ACTTCACCTG 
AAGGGCAGTG 
ACCCOGAGCG 
CCAGCTGTGA 
GTGGCCCCAC 
CTGTCCACCA 
CCTACAAGTC 
AACTTGAGTG 
AATGTTCAAG 
TTGGCACTGT 
GGACATGTGG 
AGTCCAACAT 
CACCATTTCT 
GCAGCTGCCA 
TTCAAAAGAA 
AGACAGATAA 
GGCTGAAACC 
CCTTCAACTG 
TAATGAAGGG 
TCAGAATTCC 
ATTTTGTGGC 
TGTCATAAGA 
ATAACTTAAA 
TGGTGCAAAT 
ATCCCTATGG 
ATTGAATATA 
AAAACTTGTA 
TCATTGTTTA 
ATTGTCCCCT 
TTGTTTTTTG 
GTCAGATATT 
GTTTTGAACT 
TGTTGGAAAA 
ATGTGATATG 
TCACCATGTA 
CCCTATTTGT 
AAGCATTTAT 
TTGATCACTG 
GAGTGTGAGA 
GTTTCAGAGA 
GTGATAAGGA 
GAGAGAACAA 
GTTTAAGTGT 
TGTGGTTTGT 
TTTTAAAGTA 
TTGACAATAA 



GCAAATTGTG 



GCCAAGCAGC 
AGCCTGCAAT 
TTTCCAAAAC 
ATTTGAACTA 
GAAOCCAACG 
GAGGTGCAGC 
TGAGGAAGGC 
GACACAGCAA 
AGGCTACATG 
GTTCTCCTGT 
AGGGGAGTGG 
GCCCCCGAAG 
CTCTTGTGCC 
CACATCTCAG 
CCTGGCAGTT 
GTGCAAGTTC 
AGCCACAGGA 
TCCCTTGGTA 
CCTCTGGCTT 
AAGCCTTGAA 
TCAGAAACAG 
CTCTCCTCGG 
GCAACACCCA 
AAAAGACTCA 
AAAGGATATT 
TTTTCTAACT 



ATGAATAATA 
AAAATGACAG 
CCTACTGAAT 
GATTCAGTGC 
TTATAATCTT 
AATGCTGTCA 
TTAACAAATT 
AGCAAGGCAT 
AAAACATGGT 
TTTTCAGAAA 
AATAAAAGCA 
GAATACAGAA 
TAAACATAAT 
AAAGAGTCAT 
TTTCTTCTGT 
GTTTATTTAT 
GCAATGAAAA 
ATCAAAACTC 
AGTTCTGGCT 
TCAGAACAGC 
CTACTGTGAT 
TGTAAATATT 
GTTTGAGTTT 
TATATTTATT 
ATGTGCTTAT 



AACTGTACAG 
AACTTCAGCT 
ATGGAGACCA 
OTGGTTGAGT 
CCTGGAAGCT 
ATGGGAGCCC 
TGTAAAGCTG 
CATTCCCCTG 
TTCATGTTGC 
ATCCCAGTTT 
AATTGTCTTC 
GAGCAGGGTT 
GACAACGAGA 
GGTTTGGTGA 
TTCAGCTGTG 
GGACAATGGA 
CCGGGAAAGA 
QCCTGTCCTG 
CACTGGTCTG 
GCTGGACTTT 
OGGAAATGCT 
TCAGACGGAA 
GTGCATCTGG 
GTCTCTGGCC 
TCACCACTTC 
GTGTTCCCTT 
TTCTTCCAAG 
CTCCCTTGCT 
TTGCCCTTCA 
ATTATCCAGA 
ATGTTGAATG 
GCTCTGTGCG 
TTCTTAAAGA 
CCATACTTCT 
ACTATGATAT 
ATGTTACATC 
GATGTTAACC 
AGAATTGGAG 
TATGTGGTTT 
TAAATGCAAA 
AGATGGATGT 
TCTTGTATAT 
CTGGTAGATT 
ATGTTAGGGT 
AAGCAGATTT 
ATTCTCAGTC 
TCCTACACTT 
GAACACTGGC 
AGAGGTTCTT 
CAGGCTATGT 
TATGTAAACT 
TATTGAGAAT 
TAAGCTTATG 
GTTT 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
34B0 
3540 
3600 
3660 
3720 
3780 



fieq ID NO: 75 grptjgin sequence t, 
Protein Accession #: NPJ)00441 



60 



65 



70 



i 
I 

MIASQFLSAL 
SILSYSPSYY 
DVGMWNDSRC 
TALES PEHGS 
ECDAVTNPAN 
AVTCRAVRQP 
VCEAFQCTAL 
EKPTCEAVRC 
WTEBVPSCQV 
SGIiIiPTCBAP 
GCYQKPSYIL 



11 
I 

TLVLLIKESG 
WIGIRKVNNV 
SKKKLALCYT 
LVCSHPLGNF 
GFVECFQNPG 
QNGSVRCSHS 
SNPERGYMNC 
DAVHQPPKGL 
VKCSSLAVPG 
TESNIPLVAG 



21 
I 

AWSYNTSTEA 
WVWVGTQKPL 
AACTNTSCSG 
SYNSSCSISC 
SPPWMTTCTF 
PAGEFTFKSS 
IiPSASGSPRY 
VRCAHSPIGE 
KINMSCSGEP 
LSAAGLSLLT 



31 
I 

MTYDEASAYC 
TEKAKNWAPG 
HGECVETINN 
DRGYLPSSME 



GSSCEPSCEQ 
PTYKBSCAPS 
VFGTVCKFAC 
LAPFLLWLRK 



41 

I 

QQRYTHLVAI 
BPNNRQKDED 
YTCKCDPGPS 
TMQCMSSGEW 
AQSLQCTSSG 
LQGPAQVECT 
GFVLKGSKRL 
CEEGFELYGS 
PEGWTLNGSA 
CLRKAKKFVP 



SI 
I 

QNKKEIEYLN 
CVEIYIKREK 
GXiKCBQIVNC 
SAPIPACNW 
NMDKEKPTCK 
TQGQWTQQ1P 
QCGPTGEWDN 
TQLECTSQGQ 
ARTCGATGHW 
ASSCQSLESD 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



75 



Seq ID NO: 76 Nucleotide sequence: 
Nucleic Acid Accession #: NM_031439 

Coding sequence: 69.. 1235 (underlined sequences correspond to start and stop codons) 



215 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



i 
I 

CCCQACCCGT 
GTGCGGCC&X 

ccctggacgc 
acaagggctc 
acgagaggaa 
tgctgggaaa 
cggagcqgct 
ggaagaagca 
tctccoggga 
agaaggagga 
accacgaggg 

CGTACGGGCT 
CCTTCTTCTC 
CAGGGCACCC 
GCTCCCTGGC 
CCCCATCTCC 
CCCACCTQGG 
TGAGCCAGGT 
CTCCTGGCCA 
CCCAGOTQAC 
CGGCCACGTA 
CCCTCGCGCC 
TTTCCTCCCA 
GCTGGACTCT 
GCCCACATTT 
CTCCCAOTGG 
GACAGACTTO 
ATTAATAAAG 
TGATAATTTT 
TCCAAAQTGA 
GATTTGAGAA 
TATTTTATTT 
ACACACACTT 
TATAAAATTC 
GATGTTTAAA 




41 
I 

GGCGAAGCGA 
CGAGGGTCTC 
CGTCCCCCGQ 
CTTCATGGTT 
CAACGCCGAG 
GAGGCCGTAC 
CTACAAGTAC 
GOOCTTCCTT 
CAGCCGGGGG 
GCCCAGCCTC 
GAGCAGTGTG 
OQTGCTGGAG 
CCGCCGCATC 
CCACTGTAGC 
GTCCCCTGTA 
ACTCCACTCC 
CTTCGACGCC 
ATTCGACCAG 
CAGTGGGCAT 
CTCCGTCCTG 
GAGGCGCCCC 
AGCCGTCCAG 
CCCCAGAGCC 
CCCACGTTCC 
CAGCCCCTGA 
ATCATCOAAA 
TGATTTAGGG 
CTCGCTAACC 
CTGCACTTTC 
TGAGAAAAAA 
AAATGGGATT 
TTTGTTAATT 
AATTTTCATT 
TATATTTCTA 



GACACGTACC 
CCGGAGGAGA 
CCCCACCTGC 
CACCCCCTGG 
CCCGGCTGTC 
AACCTCCAAG 
CTGQATCAAC 
TATTTGAACA 
GTTCCGGTCT 
GCTGATGCCA 
GTCCGGTCAG 



TTTGGCCTAA 
AGCCCCTGCA 
GAGTTGCTGT 
CTAATGGGGG 
TTCTCTCAAG 
TACGATCTGG 
TGCACCCCCT 
CAGTCAACCT 
GAGTTAAAAC 
TGTTTATTAT 
CTCTTTTACC 
AACATTTTAT 



60 
120 

iao 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620. 
1680 
1740 
1600 
1860 
1920 
1980 
2040 



45 



50 



55 



60 



65 



70 



75 



Seq ID NO; 77 Protein sequence: 
Protein Acceaaion #; NP 113627 



MASLLGAYPW 
KRLAVQNPDL 
QAKRLCKRVD 
GPAGGGGGGT 
PYSFBYAPSP 
GQLSPPPEHP 
TPTGPTBTSL 



11 
I 

PEGLECPALD 
HNAELSKMLG 
PGPtiliSSLSR 
PSSVDTYPYG 
LHCSHPLGSL 
GFDALDQLSQ 
I 6 VLADATAT 



21 
J 

AELSDGQSPP 
KSWKALTLSQ 
DQNALPEKRS 
LPTPPEMSPL 
ALGQSPGVSM 
VBLLGDMDRN 
YYNSY8VS 



31 41 51 

I I I 

AVPRPPGDKG SBSRIRRPMN APMVHAKDER 60 

KRPYVDEAER LRLQHMQDYP NYKYRPRRKK 120 

GSRGALGEKE DRGBYSPGTA LPSLRGCYHE 180 

DVLBPEQTPF SSPCQEEHGH PRRIPHLPGH 240 

MSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 

EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 



Seq ID NO: 78 Nucleotide sequence i 
Nucleic Acid Accession #: XM_03S787 

Coding sequence: 329.. 949 (underlined sequences correspond to start and stop codons) 



TGCCCCGCCC 
AGGAGGAGGG 
CCTCCTCCTC 
CCGCTCCCGG 
GCAGCCGCGG 
AATCCTTTGG 
TTTGGCTTTA 
TCTACAGTTC 
TCGGAAGAAG 
GCAGGAGGAC 
TGTTTTCTCT 
TTTAAGAGTA 
AGAAGATAAA 
TAACTACGTG 



11 

I 

CGCTCCCCAG 
TGGATCTCCC 
CTCCTCCAOC 
CGCGGGGCCT 
CTTCCGGAGC 
TGAAAACTGA 
CACAAAGTCA 
ATGTACGATQ 
GTAGTGCTAQ 
TACGCTGCAA 
ATTACAGAAA 
AAAGAAGATG 
AGACAGGTTT 
GAAACATCTG 



21 
1 

CGCCCCGGAA 
CAGAGCAAAG 
CGCCCAGGCT 
TCCAGOCGAC 
CCTCGGGGCG 
GACACAAAai 
TCATGGTGGG 
AGTTTGTGGA 
ATGGGGAGGA 
TTAGAGACAA 
TOGAATCCTT 
AOAATGTTCC 
CTGTAGAAGA 
CTAAAACACG 



31 
1 

GTGATCTGTG 
CGTCGGAGTC 
CCCCCGCCAC 
AAGGACCGAG 
GCGGACTGGC 
SGCTGCAAAT 
CAGTGGTGGC 
GGACTATGAG 
AGTCCAGATC 
CTACTTCOGA 
TGCAGCTACA 
ATTTCTACTG 
GGCAAAAAAC 
AGCTAATGTT 



41 
I 

GCGGCTGCTG 
CTCCTCCTCC 
CCGTCAGACT 
TACCCTCCGG 
TCGCGGTGCA 
AAGCCCAAGG 
GTGGGCAAGT 
CCTACCAAAG 
GATATCTTAG 
AGTGGGGAGG 
GCTGACTTCA 
GTTGGTAACA 
AGAGCTGAGC 
GACAAGGTAT 



51 
I 

CAGAGCCGCC 
TTCTCCTCCT 
CCTCCTTCGA 
CCGGAGCCAC 
GATTCTTCTT 
GTCAGAATTC 
CAGCTCTGAC 
CAGACAGCTA 
ATACAGCTGG 
GGTTCCTCTG 
GGGAGCAGAT 
AATCAGATTT 
AGTGGAATGT 
TTTTTGATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



216 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



AATGAGAGAA 
GAGGAAAAGT 
ACTCCTTTCT 
TTAATTQACT 
TTGGAACTGC 
AGCAAAGTTC 
AAGCTTGTGT 
TGGGAGGAAA 
ARATGTCrrO 
CCACTTTTTT 
CATATAGTTA 
TTAGAAAAGT 
GAAAAATATC 
CCATGTGTCC 
AAGTAACTGT 
AAAATAAGAT 
AGGGCCAQCA 
TTTGAATGAA 
GTGAGCATGT 
AGTGCTAATG 
ATTTCTAAAT 
TTTCTGTTTC 
CCATCAAACC 
TCACCATTGT 
CAGTGGGAAA 
AGTTGATTCA 
TGTGTTCAGG 
ACTCACTCTT 
TGAGATGTAC 
TACCAATGTA 
TTTAACCAGG 
GGTACTGAGG 
ACTACCCGAT 



ATTCGAGCGA 
TTAGCCAAGA 
TATCTTGACC 
GAAATTACTT 
AATGAAAGTC 
AACTTATTTC 
TTCTTGGGCA 
GGTGACTTCC 
GTCTTCTACT 
TAACCATTAT 
AACTGAGAGT 
GGTGTAAACT 
TGGTTATATC 
TGGTGAGAAA 
CCGCTAGAAG 
TCCAGAGCTC 
TATATACTTG 
CCCTCCTTTT 
AAGTGTTAAG 
CATTTTGCAC 
TTATATTCAT 
TGTTTATAAT 
TGGGTATAGT 
GTGGTGTACC 
TATGCCACTG 
ACAAAGTATT 
CATTCCAGGT 
TAGATATTTA 
ATCTTTCATT 
ACACTGGCCA 
GOTCCTAACC 
TGCTATGAAG 
TTGTTTATTT 



GAAAGATGGA 
GAATCAGAGA 
ATACTAATAA 
TAACATTTTG 
AAATTTACTT 
ATAATTGCCT 
GTCTTTCTTQ 
TCTGGTGTTT 
GCCTTGAAAA 
TATGCAAAAT 
AATTCATCTG 
TGTACATGGA 
ATTCTGGGTG 
ATATATGCCT 
TCTGTCCAAA 
TTTGATCGCT 
CAAGATAATT 
CTCTGAGATT 
TTTTTAATCT 
TAGAACGCTT 
AAAGTTACAG 
GAAGAACACT 
GCAGAAAACG 
TGCTGGAAGA 
ACCGATTTTT 
TTTTTCTTTT 
AACAGGTGTG 
CATCCAGCTT 
TCGTATTTCT 
GOGGGCCCAG 
ACTAACATTG 
CCAACTGACA 
GCAATTTGAG 



AGACAGCAAA 
AAGATGCTGC 
ATATAATTTA 
GAAATTGTTG 
TAAAAAGAAA 
ACATTTATCA 
AAATTGAAGA 
ATTATAAAGC 
ATGACAATTG 
TTAGAAGAAA 
TGAATCTGCT 
ATTTTTTGAA 
TGTTCTTACT 
GGCACAGCTT 
TTTAAAATGT 
TTTAATAAAC 
TTCAGCTGCA 
CTGGTCCCTG 
GGGAGCAGGG 
CGGGAAAATA 
TTTGATACAG 
GTAGCTACAT 
TGGCACACAC 
ATTCTAGCAT 
TTTTTTTCCT 
TTCTCAGTCC 
TATGTAAAGT 
CTCATGTTAA 
CATAGGCTAT 
CAATCTCCAT 
TGACTTTGCT 
AAGATGCATC 
CCATTTAAAG 



GAAAAGAATG 
ATTTTATAAT 
TAAGCATTGC 
TATATCACTA 
TTAATATGGC 
TGGTCCTGAA 
GGTGAAATGG 
TTAAATTTTA 
TGAACATGAT 
AGTTATTGGC 
TTAATTACCT 
TATGCCTTAA 
GACACCAGGG 
TTGTATAGAA 
GTGCCATATT 



AGGATTCAGC 
GAAATCCCTT 
CATAGOAAGA 
TTCATGCTTG 
GAATTATTAG 
TTTCAGAAGT 
TGACCACACA 
GCTACTTGGG 
CTTTGCAGTO 
TAATTTGAAC 
TAAAAATAGG 
ATATTTCTCC 
GCCATGTGCG 
GTGTACTTAT 
TTGAGACCTT 
ACGTGTCTTA 
ACCAATAAAC 



GAAAAAAGAA 
CAAAGCCCAA 
CATTGAAGGC 
AAAGCATGAA 
TTCACCAAGA 
TGTAGCGTGT 
GGGTGGGGAG 
TATCATTTTA 
AGTTAAACTA 
ATGGTTGTTG 
GGTGAGTAAC 
TTTAGAAACT 
GTCCGCTGCC 
AATTCTTGAG 
CTGGTTCTTG 
TTTTAAATGA 
ACCAGTTATG 
TCTGCTAGTG 
AAATGTCAGT 
CCATCTGTTC 
GAGTAATTCT 
TAACATCAAG 
TTAGGCTGTG 
GACATAATTT 
GGGCTAGGAC 
AGGTCAAAGA 
CTTTTTAGGA 
TTAAAGGGTT 
GAATTCAAGT 
TACAGTCTTA 
TCCTCTCCTG 
GGCTGATGCC 
TTCCTTTTTT 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1S00 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
2700 
2760 



Seq ID NO: 79 P rgtein sequence; 
Protein Accession ft: XP 035787 



40 



45 



50 



55 



60 



65 



70 



75 



1 11 21 31 41 51 

I I I 1 1 I 

MAANKPKGQN SLALHKVIMV GSGGVGKSAL TLQFMYDSFV EDYEPTKADS YRKKWLDGE 60 

EVQIDILDTA GQEDYAAIRD NYFRSGEGFL CVFSITEMES PAATADFREQ ILRVKEDENV 120 

PFLLVGNKSD LEDKRQVSVE EAKNRAEQWN VNYVETSAKT RANVDKVFFD LMREIRARKW 180 
EDSKEKKGKK KRKSLAKRIR ERCCIL 

Seq ID NO; 80 Nucleotide, Sgquenc^i 
Nucleic Acid Accession ft; NM_003467 

Coding sequence: 89.. 1147 (underlined sequences correspond to start and stop codons) 



1 
I 

GTTTGTTGGC 
CACCGCATCT 
CTACACCGAG 
AGAAAATGCT 
TGGCATTGTG 
CATGACGGAC 
TCCCTTCTGG 
AGTCCATGTC 
TCTGGACCGC 
GGCTGAAAAG 
CTTCATCTTT 
CAATGACTTG 
TGGTATTGTC 
CCACCAGAAG 
TTGGCTGCCT 
GCAAGGGTGT 
TTTCTTCCAC 
CTCTGCCCAG 
AGGAAAGCGA 
CAGCTAACAC 
ACATTTTTCA 

TGTTTCATAT 
CTCGTGGTAG 
AAGCTAGAAA 



11 
i 

TOCGGCAGCA 
GGAGAACCAG 
GAAATGGGCT 
AATTTCAATA 
GGCAATGGAT 
AAGTACAGGC 
GCAGTTGATG 
ATCTACACAG 
TACCTGGCCA 
GTGGTCTATG 
GCCAAOGTCA 
TGGGTGGTTG 
ATCCTGTCCT 
CGCAAGGCCC 
TACTACATTG 
GAGTTTGAGA 
TGTTGTCTGA 
CACGCACTCA 
GGTGGACATT 
AGATGTAAAA 
GATATAAAAG 
TTTCTTTAGT 
TGATGTGTGT 
GACTGTAGAA 
TGATCCCCAG 



21 
I 

GGTAGCAAAG 
CGGTTACCAT 
CAGGGGACTA 
AAATCTTCCT 
TGGTCATCCT 
TGCACCTGTC 
CCGTGGCAAA 
TCAACCTCTA 
TCGTCCACGC 
TTGGCGTCTG 
GTGAGGCAGA 
TGTTCCAGTT 
GCTATTGCAT 
TCAAGACCAC 
GGATCAGCAT 
ACACTGTGCA 
ACCCCATCCT 
CCTCTGTGAG 
CATCTGTTTC 
GACTTTTTTT 
ACTGACCAAT 
TTTTGTGAAG 
CTAGGCAGGA 
AAGGGAACTG 
CTGTTTATGC 



31 
I 

TGACGCCGAG 
gGAGGGGATC 
TGACTCCATG 
GCCCACCATC 
GGTCATGGGT 
AGTGGCCGAC 
CTGGTACTTT 
CAOCAGTGTC 
CACCAACAGT 
GATCCCTGCC 
TGACAGATAT 
TCAGCACATC 
TATCATCTCC 
AGTCATCCTC 
CGACTCCTTC 
CAAGTGGATT 
CTATGCTTTC 
CAGAGGGTCC 
CACTGAGTCT 
TATACGATAA 
ATTGTACAGT 
TTTAATTGAC 
CCTGTGGCCA 
AACATTCCAG 
ATAGATAATC 



41 
I 

GGCCTGAGTG 
AGTATATACA 
AAGGAACCCT 
TACTCCATCA 
TACCAGAAGA 
CTCCTCTTTG 
GGGAACTTCC 
CTCATCCTGG 
CAGAGGCCAA 
CTCCTGCTGA 
ATCTGTGACC 
ATGGTTGGCC 
AAGCTGTCAC 
ATCCTGGCTT 
ATCCTCCTGG 
TCCATCACCG 
CTTGGAGCCA 
AGCCTCAAGA 
GAGTCTTCAA 
ATAACTTTTT 
TTTTATTGCT 
TTATTTATAT 
AGTTCTTAGT 
AGOGTGTAGT 
TCTCCATTCC 



51 
I 

CTCCAGTAGC 
CTTCAGATAA 
GTTTCCX3TGA 
TCTTCTTAAC 
AACTGAGAAG 
TCATCACGCT 
TATGCAAGGC 
CCTrCATCAO 
GGAAGCTGTT 
CTATTCCCGA 
GCTTCTACCC 
TTATCCTGCC 
ACTCCAAGGG 
TCTTCGCCTG 
AAATCATCAA 
AGGCCCTAGC 
AATTTAAAAC 
TCCTCTCCAA 
GTTTTCACTC 
TTTAAGTTAC 
TGTTGGATTT 
AAATTTTTTT 
TGCTGTATGT 
GAATCACGTA 
CGTGGAACGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



217 



WO 02/079492 



PCT/US02/04915 



TTTTCCTGTT CTTAAGACGT GATTTTGCTG TAGAAGATGG CACTTATAAC CAAAGCCCAA 1560 
AGTGGTATAG AAATGCTGGT TTTTCAGTTT TCAGGAGTGG QTTGATTTCA GCACCTACAG 1620 
TGTACAGTCT TOTATTAAOT TGTTAATAAA AGTACATGfTT AAACTTACTT AGTGTTATG 



Seq ID NO: 81 . 

Protein Accession #< 



1 11 21 31 41 SI 

I I I I I I 

MEGISIYTSD NYTBEMGSGD YDSMKEPCFR EKNANFNKIF LPTIYSIIFL TGIVGNGLVI 60 
LVMGYQKKLR SMTDKYRLHT* SVADLLFVIT LPPWAVDAVA NWYFGNFLCK AVHVIYTVNL 120 
YSSVLILAPI SLDRYIiAIVH ATNSQRPRKL LAEKWYVGV WIPALLUTIP DFIFANVSEA 180 
DDRYICDRPY PNDLWVWFQ FQHIMVGLIL PGIVILSCYC IIISKLSHSK GHQKRKALKT 240 
TVILILAPFA CWLPYYIGIS IDSFILLEII KQGCBFENTV HKWISITEAL AFPHCCLNFI 300 
LYAFLOAKPK TSAQHALTSV SRGSSLKILS KGKRGGH8SV STESBSSSFH SS 



Seq ID HO: 82 Nucleotide sequence: 
Nucleic Acid Accession #: NM_0149S9 

Coding sequence: 314.. 1609 (underlined sequences correspond to start and stop codons) 



CTGGTTCTCA 
GGGAAACAAG 
GTGCAAGCAT 
ATTTGACAAA 
CCGAGACGGG 
TTTTTCCAAG 
TGTCAACTAT 
TTCCTCTTAT 
TCTGGGGCCT 
CGTTTGGTTC 
AAGGGATGAG 
CCTGCAGCAC 
AGAGGAGGCT 
CGTCTCCTGG 
CCGGGTGGAG 
GCTGCGGATC 
TTATCACCCC 
GCTAACAAAG 
GCCCCCAATG 
GAAAGTAATG 
CTCAAAATTC 
ACATGGGACT 
ATCAGCCCCT 
AGCCAGGATG 
TGAGAATGAG 
GCTGAGCATG 
TGAAAGGGAC 
TAGGTAGTCT 
TCATTGATTT 
AGGACAGACT 
AATGTCTGAA 
CACTATTTTA 
• GTGTTTATGA 
CCTATGGACA 
AAGCTTCTCA 
GTTAGGACTT 
TGAAAGAGAG 
GTGTGTAGCC 
CTGAGATGAA 
CAGCTTATTC 
ACATTTGGAT 
ATGTTTTCGT 
TAGTAGACAC 
AGTCTCTTTT 
ATTTATCCTC 
CCACTTAAAG 
TAGGATATAG 
ATATATTTTC 
TTATGTTTTT 
GTTCTGCAAA 
TAGGTACAGA 
TCAGTGATGT 



11 
I 

ACTTCTTTTG 
CTCTCAGGAC 
CTGGGCCATC 
TGGAAAAAAA 
TATACAGGGA 
AAGATGATGA 
CTGGGGGGGA 
GCTTCTAAAG 
GAAGGAAATG 
CCCACTGCTG 
GTCACAGTGA 
CATGAACAGT 
GTCGCCGAAA 
TTTCTCGTTG 
CCTTTCTATG 
GCCAGTGGGA 
CACCCCGAAG 
GCGATAGATG 
GAACCCCTGA 
CCCAAGGAGT 
TATGCTGGGC 
TTGGTGTGGG 
CCTCCTTTCT 
GGGGACCTGA 
AAGGAGCTGG 
GTGGAGAAGA 
CCTTACCTCG 
GGAAGAGAGA 
CAGTGTTCAA 
TGTAACCTGG 
GAAGGTAGTA 
TCCATTGACA 
AGGATGGGGC 
CTCCGTTTGA 
GCTGAGGACA 
TAACACTTTA 
CACAAAAATG 
TGCTACAAGG 
ATGTGGTAAA 
TGAGACAGAC 
TCTACCATAQ 
GGCACACATA 
CATCTGGTTG 
AACTGGAAGA 
AGTCAGCCAG 
TGGAATCTAG 
TTATCTTCTA 
TACTTTTAAT 
CCAGATATTT 
TTTCCACTTC 
TTCTCTTATT 
TTTTTGCTTG 



21 
I 

AAATAATGTT 
TTCCGGTCGC 
TTCAATGGTA 
GGAGTGTCCA 
GCTACCCTGT 
GACAGAGGCA 
CATTCCCAGG 
TCTGTTTTGA 
TGGATGTTGA 
GCTGGTATCT 
CGATTGCGTT 
GGCTGGTGGG 
TCCACCTCCC 
CCCATTTTAA 
CTGTCCTGGA 
CTCGCCTCTC 
ATATTAAGTT 
ATGAGGAAGA 
ACTTTGGTTC 
TGAAATTGTC 
AGATGAAGGA 
ATACTGAGGT 
CAGGTGCAGC 
AAGGGGTGCT 
TGGAGCAGGA 
AAGGGGACCT 
TGTCCTATCT 
ATCCAGCGTT 
GACAGAAGAA 
CATGTACCTA 
ATATTCCTTT 
TGATTCTTGA 
CTGGAAAGGC 
AGTATCACCT 
CTCAAGGCAT 
TCTATGGCTA 
GGAGAAAATG 
AGTTGTTGGG 
TCAACTCCAC 
ATTCCTGGCA 
ACTCTGTCAT 
TTTCCATCCT 
AGTCAGTTTT 
TTTCAATTAC 
TTTGTTATGT 
GCACTTTATC 
CATAATCTTT 
CACTCAGAAG 
ACCATTTCTG 
TTCTGATAGA 
TTTTGCTTCC 
TAGTATTTTT 



31 
I 

CATAGAGAAG 
CATGATGGCT 
AAAAAGATAC 
GAAAAGAGTA 
GTTTCTGAGA 
GAGCCATTAT 
AGACATTTGC 
GATCGAAGAA 
GTTGATTGAT 
GTGGTCAGCC 
TGGTTCCTGG 
CGGCCCCTTG 
CCACTTCATC 
GAATGAAGGG 
AAGCCCCAGC 
CATCCCCATC 
CCACTTGTAC 
TCGCTTCCAT 
CAGTTATATT 
CTACAGGAGC 
ACCCATTCAA 
GAAGCCAGTG 
CTTTGTGAAG 
CGATGATCTC 
AAAGACACGG 
GGCCCTGGAC 
TAGACAGCAG 
CTCATTGGAA 
GACPGGGTAA 
TTGACTGTAT 
TAAATTTTTT 
AGACCCAGGA 
AACTTTTCCT 
TCTCATAACT 
ACATGATGAC 
CTGTTATTAG 
CAAACATGAG 
TTAAATGTTC 
AGAACCACCA 
ATGTACCATA 
TTTGTAGCCA 
TTTATGTTTA 
TTTTATGGTG 
TTAOGTTAAT 
CTTTTCTATT 
ACCATTTAGA 
CTGTATCTTA 
ATTTAAAAAA 
TTGCTCTTCC 
CGTTTTTTAG 
TCTGAGGACA 
AGTTGACATT 



41 
I 

GAGGGCTGTC 
GTGGGCGGTA 
AGTAAAGACA 
GCAGCAGTGA 
CCCTTTGTGA 
TGTTCCGTGC 
TCAGAAGAGA 
GATTATAAAA 
AAGAGCACAA 
ACAGGCCTCG 
AGTCAGCACC 
TTTGATGTCA 
TCCCTCCAAG 
ATGGTCCTGG 
rrCTCTCTGA 
ACTTCCAACA 
CTTGTCCCCA 
GGTGTGCGCC 
GTGTCTAATT 
CCTGGAGAAA 
CTTGAGATTA 
GATCTCCAGC 
GAGAACCACC 
CAGGACAATG 
CAGAGCAAGA 
GTGCTCTTCA 
AATTTGTAAA 
ATGGATAAAC 
CATCTATCAC 
CCTCATGCAT 
CCAACCATTG 
TAAAGGACAT 
GATTAATGTG 
AAAAGCAGAA 
AGTCTTTTTT 
AACAATGTAA 
CAGAAAATAT 
ATGGTCAACT 
AAAAGAAAAT 
CAAAAAATAA 
TTTCAGCTGT 
ATCTGTTTAA 
TATTTTX3AAC 
GTAATTATTA 
CTACTGTTAT 
TCCTATTACC 
AAACCCATCA 
CTCATGAGAA 
TTCATTATTT 
TTCTTTTAGA 
TCTTTTTCTC 
GTTTTCTOTT 



51 
I 

TGAGATTCGA 
AACGCGGTTA 
TAAATACCAC 
GGAAGAGCTG 
CATCTCACAT 
TGTTCCTGAG 
ATCAAATAGT 
ATCGTCAGTT 
ACAGATACAG 
GCTTCCTGGT 
TGGCCCTGGA 
CTGCAGAGCC 
GTGAGGTGGA 
AGCATCCAGC 
TGGGCATCCT 
CATTGATCTA 
GCGACGCCTT 
TGCAGACTTC 
CTGCTAACCT 
TTCAGCACTT 
CTGAAAAAAG 
TTGTAGCTGC 
GGCAACTCCA 
AGGTTCTTAC 
ATGAGGCCTT 
GAAGCATTAG 
ATGAGTCAGT 
AGAAATGTGA 
ACAGGCTTTC 
TTTCCTCAAG 
CTTGATATAT 
CCGGATAGGT 
AAAAATAATT 
AAGCTAACAA 
TTTTTTGTAT 
ATGTATTTGC 
TTTCCCACTG 
CCAAGGAATA 
GAGGGTAATT 
GCCAACTCTG 
CTTTTGATTA 
AACAAGTTCC 
CCATTCTGAT 
ATATGTTAGG 
CACATTTGTA 
TTTTCTCATC 
ATAAATTATT 
GAGTAATCTG 
TCCAAATTTC 
GTGGTTCTGA 
ACCTTCATTC 
CAGCAGTTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
19B0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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10 



15 



20 



25 



30 



35 



CTTTTAGCTT 
TATGTAGTGT 
TCATTATGTT 
AATTTGCAGT 
CTTTTTATTA 
TAGCTTATAT 
TTTTATATCC 
GTTTGAAATC 
AATGGAAACC 
TTGAATCAAT 
AATQTTAGAC 
TTCATTTTTT 
AGTGTTCACC 
TGGTCACTGT 
GGCTTATATT 
TCAAGACAGG 
CTGCAACCTC 
ATTACAGGCA 
TTTGCTATGT 
CCTCCCAAAG 
T T TTTTTTTT 
CTCAGCTCAC 
AGTAGCTAGG 
TAGTAGAGAC 
CCACCCCCCT 
CATTTGAGTA 
GTTATTCAGT 
GCTCATCCTT 
TGAGACTCTG 
AACTGAGGTC 
TGTGTGTGCC 
AATGTGAATT 
AAAGTCTTTA 



CCGTATTTOC 
GTCATTTTTC 
GGGGATGAGT 
TTTATGTCTT 
GCCTGATTTT 
GCCCAGAAGG 
TTCTCATGTC 
AACAGTCTTG 
TCTTACATGT 
CAATATTATA 
TTTTATGTTT 
CCATTATTCT 
TGTTGTTGTC 
ATGTATCAQT 
CTATTTTCCT 
GTCTCAACTC 
TGCCTCCTGG 
TGCACCACCA 
TOGCCAGGCT 
TGCTGGGATT 
TTGAGATGQA 
TGCAGCCTCT 
ACTACAGGTG 
AGGGTTTCAC 
CGGCCTTCCA 
TTTTTATAAT 
GTTTGGTGTC 
GTATTCTCAG 
TTTTATTTGT 
TTAATATCAG 
TATGAGATTG 
AGGACCAGCG 
TATGCTCAG 



TGATGAGAAA 
TGTCAGATTT 
TTCCTTCTTT 
TTACCAAACT 
CATCTTTATA 
CCTTCAAAAT 
TTCTACTGTA 
AGAATAGATQ 
GATTTTCCTT 
TTTTGTTTTT 
TCCTAAATGT 
GATTGGGTAA 
TGTGTCGTCC 
TCTAAAATTT 
GCAAATGTGT 
TGTTACCCAG 
TTCAAGCGAT 
CAGCCCAGCT 
GGTTTTOAAC 
ACAGGCCACT 
GTCTCGCTCT 
GTCTCCCGGG 
CATGCCAACA 
CATTTTGGCC 
AAGTGCTGGG 
GTCTCTTTTA 
CACTGAGTTG 
TAGTTCCGAT 
ATCCAACAGA 
CTCATTTTAA 
GGTGCAGTGT 
CAATGAATGC 



TCTGCAGTCA 
CAAGGTATTT 
TATTCCCTTT 
TAGAGGTTTT 
GGAAATAGTT 
AAGAATTTTG 
AAATTCATAT 
AAAATTTTGA 
GCCATCTAGA 
TTCCTCCTCT 
CCCTGATATT 
TTTTAATTTG 
CACTGAGTGC 
CCATTTTGTT 
CAGCATTTGC 
GCTGQAGTGC 
TATTGTGCCT 
AATTTTTTGT 
TCCTGGCCTC 
ACACCTGGCA 
GTCATCTAGG 
CTCAAGCGAT 
CGCCCGGCTA 
AGGATGGTCT 
ATTACAGGCA 
AAGTCTTTGT 
TCATTTGCCA 
ATGTACCCTC 
AGATGTTTAT 
AAGTCTTTGC 
ATCCTGTTAG 
TCAAGTTGGG 



TTCAAATTGT 
ATCTTTAGTT 
GGAATTTGCT 
CAGCCTAATT 
TAAGTGATGA 
AAAGAATACA 
GCTTTGCTAC 
TGAATAGTGG 
AATAAACCAT 
TCTGAGACTC 
CTACTTATTT 
TCTATTTTCA 
ATTCACCACC 
CTCTATATTT 
TTGTTTGAGC 
AGTGGTGCGA 
CAGCCTCCTO 
ATTTTTAGTA 
AAGTGATCCA 
CATTTGAGTA 
CTGGAGTGCA 
TCTCTTGCCT 
ATTTTTTTAA 
CGATCTCCTG 
TGAGCCACCG 
CAGATAATTC 
GACAAGTOGA 
GACATGTGAA 
TATTTATTTG 
AGTGGTATTC 
CTCCATTCTC 
GTTGGGCGTT 



TGTTTCCCTG 
TTTAGCCATT 
CCAATTCATA 
TCTAAAAATA 
CAAGTTCCAA 
GAAAACAAAC 
TCTAAACCTA 
AATTCTTTTA 
AGTATTTATG 
TTATTGTGGA 
AGAACATCTT 
AATTTGCTGG 
TTTTAAATTT 
TAAATTTCTT 
CTTllTOTf 
TCTCAGCTCA 
AGTAGCTGGG 
GAGACAGAGT 
CCCACCTCAG 
TTTTTTTTTT 
GTGGTGTGAT 
CAGCCTCCTG 
AAAATATTTT 
ACCTCATGAT 
TGCCTGGCCT 
CACTGTACAT 
GATTTTTGCA 
TGTTATCTTA 
GCTTTCTGTG 
GGATCTATCC 
AGGGCOTTTG 
AGAATTCATA 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3640 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 



Seq ID NO: 83 protein sequence t 
Protein Accession th NP__055774 



40 



45 



50 



55 



60 



65 



70 



75 



MMRQRQSHYC 
GNVDVELIDK 
EQWLVGGPLP 
FYAVLESPSF 



AGQMKEPIQL 
DLKGVLDDLQ 
YLVSYLRQQN 



11 
I 

SVLFLSVNYL 
STNRYSVWPP 
DVTAEPEEAV 
SLMGILLRIA 
VRLQTSPPME 
EITEKRHGTL 
DNEVLTENEK 
L 



21 
I 

GGTPPGDICS 
TAGWYLWSAT 
ABIHLPHFIS 
SGTRLSIPIT 
PLNFGSSYIV 
VWDTEVKPVD 
BliVEQEKTRQ 



31 

I 

EENQIVSSYA 
GLGPLVRDEV 
LQGBVDVSWF 
SNTLIYYHPH 
SNSANLKVMP 
LQLVAASAPP 
SKNEALLSMV 



41 
I 

SKVCFEIEED 
TVTIAPGSWS 
LVAHFKNEGM 
PBDIKFKLYL 
KELKLSYRSP 
PFSGAAFVKE 
EKKGDLALDV 



51 
I 

YKKRQFLGPE 60 

QHLALDLQHH 120 

VLEHPARVEP 180 

VFSDALLTKA 240 

GEIQHFSKFY 300 

NHRQLQARMG 360 

LPRSISERDP 420 



Seq ID NO: 84 Nucleotide sequenc e: 
Nucleic Acid Accession #t NM_007036 

Coding sequence: 56-610 (underlined sequences correspond to start and stop codons) 



1 
I 

CTTCCCACCA 
GAGCGTCTTG 
TAATTATGCG 
CTGCAAGAGG 
AGAAACTTGC 
GTGTCAGCCT 
TCCCTACGGC 
TGACAGGGGG 
TTCCAACAGA 
GAGAGAAGAA 
TCCACGCTGA 
CACAGCCAAC 
CCAAATTGTG 
ATCCATATGA 
AAATGTGTGT 
AGACAGGTCA 
TCTTTGACTT 
GATGGGGAGG 
TCTAGAATTT 



11 
I 

GCAAAGACCA 
CTGCTGACCA 
GTGGACTGCC 
ACAGTGCTCG 
TACCGCACAG 
TCTAATGGGG 
ACCTTCGGGA 
ACGGGAAAAT 
TTTGTTTCTC 
GTTGTGAAAG 
TCCCGGCTGT 
ATTTTAGGAA 
ATGCATGGTG 
CTGAACACTT 
GTATAGTAAC 
ACCAAAGAGG 
TGATGTACAT 
GGGTGGGAGT 
AATTGTGCTT 



21 
I 

CGACTGGAGA 
CGCTCCTCGT 
CTCAACACTG 
ACGACTGTGG 
TCTCAGGCAT 
AGGATCCTTT 
TGGATTGCAG 
OCCTGAAATT 
TCACGGAGCA 
AGAATGCTGC 
GATTTCTGAG 
CTTTCTAGAT 
GATCCAGAAA 
GTATGTGTTT 
ACTGAAGAAC 
GAGCTAGGCA 
TAATGTTGGG 
GGGAAATAAA 
TTTTTTTTTT 



31 
I 

GCCGAGCCGG 
GCCTGCACAC 
TGACAGCAGT 
CTGCTGCCGA 
GGATGGCATG 
TGGTGAAGAG 
AGAGACCTGC 
CCCCTTCTTC 
TGACATGGCA 
OKSGTCTCCC 
AGAAGGCTCT 
ATAGCATAAG 
ACAAAAAGTA 
GTTAAATATT 
TAAAAATGCA 
AAGCTGAAGA 
ATATGGAATG 
ATATTTAGCC 
TTTGGCTTTG 



41 

I 

AGGCAGCTGG 
CTGGTGGCCG 
GAGTGCAAAA 
GTGTGCGCTG 
AAGTGTGGCC 
TTTGGTATCT 
AACTGCCAGT 
CAATATTCAG 
TCTGGAGATG 
GTAATGAGGA 
ATTTTCGTGA 
TACATGTAAT 
GGATACTTAC 
CGAATGCATG 
ATTTAGGTAA 
CCGCAGTGAG 
AAGACTTAAG 
CTTCCTTGOT 
GGAAAAGTCA 



51 
I 

GAAACATSAA 
CCTGGAGCAA 
GCAGCCCGCG 
CAGGGCGGGG 
GGGGGCTGAG 
GCAAAGACTG 
CAGGCATCTG 
TAACCAAGTC 
GCAATATTGT 
AATGGTTAAA 
TTGTTCAACA 
TTTTGAAGAT 
AATCCATAAC 
TAGATTTGTT 
TCTTACATGG 
TCAAATTAGT 
AGCAGGAGAA 
AGGTAGCTTC 
AAATAAAACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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ACCAGAAAAC CCCTQAAGGA AGTAAGATGT T7GAAGCTTA TOGAAATTTG AGTAACAAAC 1200 

AOCTTTGAAC TGAGAGCAAT TTCAAAAGGC TGCTOATGTA GTTCCCGGQT TACCTGTATC 1260 

TGAAGGACGG TTCTGQGGCA TAGOAAACAC ATACACTTCC ATAAATAGCT TTAACGTATG 1320 

CCACCTCAGA GATAAATCTA AGAAGTATTT TACCCACTGG TGGTTTGTGT GT3TATGAAG 1380 

GTAAATATTT ATATATTTTT ATAAATAAAT GTGTTAGTGC AAGTCATCTT CCCTACCCAT 1440 

ATTTATCATC CTCTTGAGGA AAGAAATCTA OTATTATTTG TTGAAAATGG TTAGAATAAA 1500 

AACCTATGAC TCTATAAGGT TTTCAAACAT CTGAGGCATG ATAAATTTAT TATCCATAAT 1560 

TATAGGAGTC ACTCTGGATT TCAAAAAATQ TCAAAAAATG AGCAACAGAG GGACCTTATT 1620 

TAAACATAAG TGCTGTGACT TCGGTGAATT TTCAATTTAA GGTATGAAAA TAAGTTTTTA 1680 

GGAGGTTTGT AAAAGAAGAA TCAATTTTCA GCAGAAAACA TGTCAACTTT AAAATATAGG 1740 

TGGAATTAGG AGTATATTTG AAAGAATCTT AGCACAAACA GGACTGTTGT ACTAGATGTT 1800 

CTTAGGAAAT ATCTCAGAAG TATTTTATTT GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

AGTATTTACC TGTATTTTAT TCTTGAAGTT GGCCAACAGA GTTGTGAATG TGTOTGGAAG 1920 

GCCTTTGAAT GTAAAGCTGC ATAAGCTGTT AGGTTTTGTT TTAAAAGGAC ATGTTTATTA 1980 
TTGTTCAATA AAAAAGAACA AGATAC 



Seq ID KO: 8S Pro^n, sequence; 
Protein Accession #: NP_008967.i 



1 11 21 31 41 SI 

I I I I I I 

MKSVLLLTTL LVPAHLVAAW SNNYAVDCPQ HCDSSECKSS PRCKRTVLDD CGCCRVCAAG 60 
RGETCYRTVS GMDGMKCGPG LRCQPSNGED PFGEEFGICK DCPYGTFGMD CRETCNCQSG 120 
ICDRQTQKCL KFPFFQYSVT KSSNRFVSI/T KHDMASGDGN IVREEWKBN AAQSPVMRKW 180 
LNPR 



Seq ID NO: 86 Nucleotide sequence; 
Nucleic Acid Accession #: DB6963 

Coding sequence: 52-4491 (underlined sequences correspond to start and stop codons) 



AGCCGGCCGT 
CGCTCCAGGG 
AOGCTGGCCG 
CGCACCACCG 
ACCTCCATCC 
CGGCTGAQGA 
GGAGCATTTO 
TCAATTGACA 
AATCAGATAG 
TTTTTGCATA 
ATGAAGAGAT 
GCGGATTTGC 
TATCCCAGAC 
GAAAGGCCCC 
TACTTCACCT 
AATGAGCTGA 
ATCCAGAACA 
GGAGAGGTGA 
TTTGTAATCC 
AGCGCCACAG 
CCAGTTGACC 
CAGGGGGACA 
ACCGCTTTCA 
GTTATTGAGG 
' ATCGCCTGGA 
TCGGGAACAC 
OCTGTCAACA 
ACCCCAGTGT 
CTCCCGTGCA 
CAGGTGACAG 
GTTGGCCCTG 
TOGGTGAGCA 
GTAGCTACCT 
ACACATTTGT 
CCGAGGGATC 
CAGCTCATTC 
CACTACAACG 
ACCGCCCACC 
CACGACGGCA 
GAGCGCCTGC 
CACCGACTGT 
GGGACGGAGA 



11 
I 

GGTGGCTCCG 
GCCCCGGGCG 
TGGTGGCCCA 
TGCGCTGCAT 
TAGATCTTOG 
ACTTGAACAC 
AAGACTTGGA 
GGCAAGCATT 
AAACTTTGGA 
ACAACCGGAT 
TGCGACTGGA 
TGAAAACCTA 
GCATCCAGGG 
GGATCACCTC 
GCAGAGCCGA 
GCATGAAGAC 
CACAGGAGAC 
AGACGCAAGA 
AGCCACAGAA 
GCCACCCCCC 
CGCGGGTGAA 
GCGGAGAGTA 
TCATCGTCCA 
GCCAGACOGT 
CCAAGGGAGG 
TTAGAATCTC 
TCATCGGCTC 
TTGCCAGCAT 
GCTCCCAGGG 
AAAGTGGAAA 
CAGACGCAGG 
TGGTGCTCAG 
CCATCGTGGA 



CTTACACAGT 
AGGAGCATGT 
ACCTGGTGTC 
GGCGCGTGAA 
CCTGTAACAA 
TGAAATCCGT 
ACAACGGGCA 
CCGTCACACC 



21 
I 

TGCGTCCGAG 
CCGCTGCCTG 
GAAGCCGGGC 
GCATCTGCTG 
CTTTAACAGA 
ATTGCTTCTC 
AAATTTAAAA 
TAAGGGACTT 
CCCAGATTCG 
TACACATTTA 
CTCAAACACA 
CGCGGAGTCG 
ACGCTCAGTG 
CGAGCCCCAG 
AGGCAACCCC 
AGATTCCCGC 
AGACCAGGGT 
GGTGACCCTC 
TACAGAGGTG 
GCCGCGGATC 
CATCACGCCT 
TGCGTGCTCT 
GGCTCTTCCT 
GGATTTCCAG 
GAGCCAGCTC 
TGGTGTTGCC 
CCAGAAGGTC 
TCCCAGCGAC 
CGAGCCOGAG 
ATTTCACATC 
TCGCTATGAG 
TOTGAACGTT 
AGCGATTGCG 
TCCTCGTTCT 
TGAACAGGCA 
ACAGCATGGC 
TCCACAGTAC 
CAACTGCTCG 
CCTGCAGCAC 
GTACGAGAAT 
CGCCCTTCCC 
CGACGAGCAG 



31 
I 

CGTCCGTCCG 
TTGGCGCTCG 
GCAGGGTGTC 
CTGGAGGCCG 
ATCAGAGAGA 
AATAATAATC 
TATCTCTATC 
GCCTCTCTAG 
TTCCAGCATC 
GTTCCAGGGA 
CTTCACTGGG 
GGGAACGCGC 
GCAACCATCA 
GACGCAGATG 
AAGCCTGAGA 
CTAAACTTGC 
ATCTACCAGT 
AGGTACTTCG 
CTGGTTGGGG 
TCCTGGACGA 
TCTGGCGGGC 
GCGACCAACA 
CAGTTCACTG 
TGTGAAGCCA 
TCOGTGGACC 
CTCCACGACC 
GTGGCCCACC 
ACAACAGTGG 
CCAGCCATCA 
AGCCCTGAAG 
TGTGTGGCCC 
CCTGACGTCA 
ACTGTTGACA 
CCAAATGATT 
CGGGOGGGAG 
TTGATGGTCG 
CTGAACCTCA 
GACATGTGCT 
CCCATOTGGG 
GGCTTCAACA 
ATGCCGCGCC 
TTCACCCACA 



41 

I 

CGCCGTCGGC 
TGCTGTTCTG 
CGAGCOGCIG 
TGCCCGCCGT 
TCCAACCTGG 
AGATCAAGAG 
TGTACAAGAA 
AGCAACTATA 
TCCCGAAGCT 
CATTTAATCA 
ACTGTGAAAT 
AGGCAGCGGC 
CCCCGGAAGA 
TGACCTCGGG 
TCATCTGGCT 
TGGACGATGG 
GCATGGCAAA 
GGTCTCCAGC 
AGAGCGTCAC 
GAGGTGACCG 
TTTACATACA 
ACATTGACAG 
TOACGCCTCA 
AGGGCAACCC 
GGCGGCACCT 
AGGGCCAGTA 
TGACTGTGCA 
AGGTGGGCGC 
CCTGGAACAA 
GATTCTTGAC 
GGAACACCAT 
GTCGAAATGG 
GAGCTATAAA 
TGCTGGCCTT 
AAATCTTTGA 
ACCTCAACGG 
TCGCAAACCT 
TCCACCAGAA 
GCGCCTCGCT 
CCCCTCGGGG 
TGGTGTCCAC 
TGCTGATGCA 



51 
I 

CATGGCCAAG 
CGCCTGGGGG 
CCTGTGCTTC 
GGCGCCGCAG 
GGCATTCAGG 
GATACCTAGT 
TGAGATCCAG 
CCTGCACTTT 
CGAGAGGCTA 
CTTGGAATCT 
CCTGTGGTTG 
CATCTGTGAA 
GCTGAACTGT 
GAACACCGTG 
GCGAAACAAT 
GACCCTGATG 
GAACGTGGCC 
TCGACCCACT 
GCTGGAGTGC 
CACACCCTTG 
GAACGTCGTA 
CGTCCATGCC 
GGACAGAGTC 
GCCGCCCGTC 
GGTCCTGTCA 
CGAATGCCAG 
GCCCAGAGTC 
CAATGTGCAQ 
GGATGGGGTT 
CATCAATGAC 
TGGGTCGGCC 
AGATCOGTTT 
CTCAACCCGA 
GTTCCGGTAT 
ACGGACATTG 
AACAAGTTAC 
GTCGGGCTGT 
GTACCGGACG 
GACCGCCTTC 
CATCAACCCC 
CACCCTGATC 
GTGGGGCCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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TTCCTGOACC ACGACCTCGA CTOCACGGTG GTGGCCCTGA GCCA6GCA06 CTTCTCCGAC 2580 

GOACAGCACT GCAGCAAOGT GTGCAGCAAC GACCCCCCCT GCTTCTCTGT CATGATCCCC 2640 

CCCAATGACT CCCGGGCCAG GAGOGGGGCC CGCTGCATGT TCTTOGTGOG CTCCAGCCCT 2700 

GTGTGCGGCA GCGGCATGAC TTOGCTGCTC ATGAACTCCG TGTACCCGCG GGAGCAGATC 2760 

5 AACCAGCTCA CCTCCTACAT CGAOGCATCC AACGTGTACG GGAGCAOGGA GCATGAGGCC 2820 

CX3CAGCATCC GCGACCTGGC CAGCCACCGC GGCCTGCTGC GGCAGGGCAT CGTGCAGGGG 2880 

TCCGGGAAGC CGCTGCTCCC CTTCGCCACC GGGCCGCCCA CGGAGTGCAT GOGGGACGAG 2940 

AACGAGAGCC CCATCCCCTG CTTCCTGGCC GGGGACCACC GCGCCAAOGA GCAGCTGGGC 3000 

CTOACCAGCA TGCACACGCT GTGGTTCCGC GAGCACAACC GCATTGCCAC GGAGCTGCTC 3060 

10 AAGCTGAACC CGCACTGGGA OGGCGACACC ATCTACTATG AGACCAGGAA GATCOTGGGT 3120 

GCGGAGATCC AGCACATCAC CTACCAGCAC TGGCTCCGGA AGATCCTGGG GGAGGTGGGC 3180 

ATGAGGACGC TGGGAGAGTA CCACGGCTAC GACCCOGGCA TCAATGCTGG CATCTTCAAC 3240 

GCCTTCGCCA CCGOGGCCTT CAGGTTTGGC CACACGCTTG TCAAOCCACT GCTTTACCGG 3300 

CTGGACGAGA ACTTCCAGCC CATTGCACAA GATCACCTCC CCCTTCACAA AGCTTTCTTC 3360 

15 TCTCCCTTCC GGATTGTGAA TGAGGGGGGC ATCGATCCGC TTCTCAGGGG GCTGTTCGGG 3420 

GTGGOGGGGA AAATGCGTOT GCCCTCGCAG CTGCTGAACA CGGAGCTCAC GGAGCGGCTG 3480 

TTCTCCATGG CACACACGGT GGCTCTGGAC CTGGCGGCCA TCAACATCCA GCGGGGCCGG 3540 

GACCACGGGA TCCCACCCTA CCACGACTAC AGGGTCTACT GCAATCTATC GGCGGCACAC 3600 

ACGTTCGAGG ACCTGAAAAA TGAGATTAAA AACCCTGAGA TCCGGGAGAA ACTGAAAAGG 3660 

20 TTGTATGGCT OGACACTCAA CATCGACCTG TTTCCGGCGC TCGTGGTGGA GGACCTGGTG 3720 

CCTGGCAGCC GGCTGGGCCC CACCCTGATG TGTCTTCTCA GCACACAGTT CAAGCGCCTG 3780 

CGAGATGGGG ACAGGTTGTG GTATGAGAAC CCTGGGGTGT TCTCCCCGGC CCAGCTGACT 3840 

CAGATCAAGC AGACGTCGCT GGCCAGGATC CTATGCGACA ACGCGGACAA CATCACCCGG 3900 

GTGCAGAGOG ACGTGTTCAG GGTGGCGGAQ TTCCCTCACG GCTACGGCAG CTGTGAOGAG 3960 

25 ATCCCCAGGG TGGACCTCCG GOTGTGGCAG GACTGCTGTG AAGACTGTAG GACCAGGGGG 4020 

CAGTTCAATG OCTTTTCCTA TCATTTCOGA GGCAGAOGGT CTCTTGAGTT CAGCTACCAG 4080 

GAGGACAAGC CGACCAAGAA AACAAGACCA CGGAAAATAC CCAGTGTTGG GAGAGAGGGG 4140 

GAACATCTCA GCAACAGCAC CTCAGCCTTC AGCACACGCT CAGATGCATC TGGGACAAAT 4200 

GACTTCAGAG AGTTTGTTCT GGAAATGCAG AAGACCATCA CAGACCTCAG AACACAGATA 4260 

30 AAGAAACTTG AATCACGGCT CAGTACCACA GAGTGCGTGG ATGCCGGGGG CGAATCTCAC 4320 

GCCAACAACA CCAAGTGGAA AAAAGATGCA TGCACCATTT GTGAATGCAA AGACGGGCAG 4380 

GTCACCTGCT TCGTGGAAGC TTGCCCCCCT GCCACCTX3TG CTGTCCCCGT GAACATCCCA 4440 

GGGGCCTGCT GTCCAGTCTG CTTACAGAAQ AGGGCGGAGG AAAAGCC CTA GQ CTCCTGGG 4500 

AGGCTCCTCA GAGTTTGTCT GCTGTGCCAT CGTGAGATCG GGTGGCCGAT GGCAGGGAGC 4560 

35 TGCGGACTGC AGACCAGGAA ACACCCAGAA CTCGTGACAT TTCATGACAA CGTCCAGCTG 4620 

GTGCTGTTAC AGAAGGCAGT GCAGGAGGCT TCCAACCAGA GCATCTGCGG AOAAGGAGGC 4680 

ACAGCAGGTG CCTGAAGGGA AGCAGGCAGG AGTCCTAGCT TCACGTTAGA CTTCTCAGGT 4740 

TTTTATTTAA TTCTTTTAAA ATGAAAAATT GGTGCTACTA TTAAATTGCA CAGTTGAATC 4800 

ATTTAGGCGC CTAAATTGGT TTTGCCTCCC AACACCATTT CTTTTTAAAT AAAGCAGGAT 4860 

40 ACCTCTATAT GTCAGCCTTG CCTTGTTCAG ATGCCAGGAG CCGGCAGACC TGTCACCCGC 4920 

AGGTGGGGTG AGTCTCGGAG CTGCCAGAGG GGCTCACCGA AATCGGGGTT CCATCACAAG 4980 

CTATGTTTAA AAAGAAAATT GGTGTTTGGC AAACGGAACA GAACCTTTGA TGAGAGCGTT 5040 

CACAGGGACA CTCTCTGGGG GTGCAGTGCA AGCCCCCGGC CTCTTCCCTG GGAACCTCTG 51 00 

AACTCCTCCT TCCTCTGGGC TCTCTGTAAC ATTTCACCAC ACGTCAGCAT CTAATCCCAA 5160 

45 GACAAACATT CCCGCTGCTC GAAGCAGCTG TATAGCCTGT GACTCTCCGT GTGTCAGCTC 5220 

CTTCCACACC TGATTAGAAC ATTCATAAGC CACATTTAGA AACAGATTTG CTTTCAGCTG 5280 

TCACTTGCAC ACATACTGCC TAGTTGTGAA CCAAATGTGA AAAAACCTCC TTCATCCCAT 5340 

TGTGTATCTG ATACCTGCCG AGGGCCAAGG GTGTGTGTTG ACAACGCCGC TCCCAGCCGG 5400 

CCCTGGTTGC GTCCACGTCC TGAACAAGAG CCGCTTCCGG ATGGCTCTTC CCAAGGGAGG S460 

50 AGGAGCTCAA GTGTCGGGAA CTGTCTAACT TCAGGTTGTG TGAGTGCGTT 

Seq ID NO: 87 protein sequence: 
Protein Accession #j BAA13219 

1 11 21 31 41 51 

55 | | | | | | 

SR5WLRASE RPSAPSAMAK RSRGPGRRCL LALVLFCAKG TLAWAQKPG AGCPSRCLCF 60 

RTTVRCMHLL LKAVPAVAPQ TSILDLRFNR IREIQPGAPR RLRNLNTLLL NNNQIKRIPS 120 

GAPEDLENLK YLYLYKNEIQ StDRQAPKGL ASLEQLYLHF NQIBTLDPDS FQHLPKLERL 180 

FTiHNNRITHL VPGTPNHLES MKRLRLDSNT LHCDCEILWL ADLLKTYABS GNAQAAAICE 240 

60 YPRRIQGRSV ATITPEELNC ERPRITSSPQ DADVT8GNTV YPTCRABGNP KPEIIWLRNN 300 

NELSMKTDSR LNLIiDDGTLM IQNTQBTDQG IYQCMAKNVA GEVKTQEVTL RYFGSPARPT 360 

FVIQPQNTEV LVGESVTLEC SATGHPPPRI SWTRGDRTPL PVDPRVNITP SGGLYIQNW 420 

QGDSGEYACS ATNNIDSVHA TAPI ZVQALP QFTVTPQDRV VIEGQTVDFQ CEAKGNPPPV 480 

IAWTKGGSQL SVDRRHLVLS SGTLRISGVA LHDQGQYECQ AVNIIGSQKV VAHLTVQPRV 540 

65 TPVFASIPSD TTVEVGANVQ LPCSSQGEPE PAITWNKDGV QVTESGKPH1 SPEGPLTIND 600 

VGPADAGRYE CVARNTIGSA SVSMVLSVNV PDVSRNGDPF VATSIVEAIA TVDRAINSTR 660 

THLPDSRPRS PNDLUVLFRY PRDPYTVEQA RAGEIPERTL QLIQEHVQHG LMVDLNGTSY 720 

HYNDLVSPQY LNLIANLSGC TAHRRVNNCS DMCPHQKYRT HDGTCNKLQH PMWGASLTAF 780 

RRLLKSVYEN GFNTPRGINP HRLYNGHALP MPRLVSTTLI GTETVTPDEQ PTHMLMQWGQ 840 

70 FLDHDLDSTV VALSQARFSD GQHCSNVCSN DPPCFSVMIP PND5RAR8GA RCMPFVRSSP 900 

VCGSGMT8I1L MNSVYPREQI NQLT5YZDAS NVYGSTEHBA RSIRDLASKR GLLRQGIVQR 960 

SGKPLLPFAT GPPTBCKRDE NBSPIPCFLA GOHRAMEQL0 LTSMHTLWFR BHSRIATBLL 1020 

KLNPHWDGDT IYYETRKIVG AEIQHITYQH WLPKILGEVG MRTLGEYHGY DPGINAGIFN 1080 

AFATAAFRFG HTLVNPLLYR LDENFQPIAQ DHLPLHKAFF SPFRIVNEGG IDPLLRGLFG 1140 

75 VAGKMRVPSQ LLNTELTKRL PSMAHTVALD LAAINIQRGR DHGIPPYHDY RVYCKLSAAH 1200 

TFEDLKNBIK NPEIREKLKR LYGSTLNIDL FPALWEDLV PGSRLGPT1M CLLSTQFKRL 1260 
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RDGDRIiWYEW PGVFSPAQLT QIKQTSLARI LCDNADNITR VQSDVFRVAF. FPHGYGSCDB 1320 

IPRVDLRVWQ DCCBDCRTEG QFNAFSYHFR GRRSLBFSYQ BDXPTKKTRP RKIPSVGRQG 1380 

EHLSNSTSAF STRSDASGTN DPRBPVLEMQ KTTTDLRTQI KKLBSRLSTT ECVUAGGESH 1440 
ANNTKWKKDA CTICBCKDGQ VTCFVEACPP ATCAVPVNTP GACCPVCLQK EABSKP 



Seq ID NO: 86 DNA sequence 

Nucleic Acid Accession #: 834.1 

Coding sequence: 80*3577 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

1 I I I I I 

AATTCGAGOA TCCOOGTACC ATGGCACAGA GCGACAGAQA CATTTATTOT TATTTQTTTT 60 

TTGGTGGCAA AAAGGGAAAA_XGGCGAACGA CTCCCCTGCA AAAAGTCTGG TGOACATCGA 120 

CCTCTCCTCC CTSCGGGATC CTGCTGGQAT TTTTQAOCTQ GTGGAAGTGG TTGGAAATGG 180 

CACCTATGGA CAAGTCTATA AGGGTCGACA TGTTAAAACG GGTCAGTTGG CAGCCATCAA 240 

AGTTATGGAT QTCACTGAGG ATGAAGAGGA AGAAATCAAA CTGGAGATAA ATATGCTAAA 300 

GAAATACTCT CATCACAGAA ACATTGCAAC ATATTATQGT GCTTTCATCA AAAAGAGCCC 360 

TCCAGGACAT GATGACCAAC TCTGQCTTGT TATGGAGTTC TGTGGGGCTG GGTCCATTAC 420 

AGACCTTGTG AAGAACACCA AAGGGAACAC ACTCAAAGAA GACTGGATCG CTTACATCTC 460 

CAGAGAAATC CTGAGGGGAC TGGCACATCT TCACATTCAT CATGTGATTC ACCGGGATAT 540 

CAAGGGCCAG AATGTGTTGC TGACTGAGAA TGCAGAGGTG AAACTTGTTG ACTTTGGTGT 600 

GAGTGCTCAG CTGGACAGGA CTOTGGGGCG GAGAAATAOG TTCATAGGCA CTCCCTACTG 660 

GATGGCTCCT GAGGTCATCG CCTGTGATGA GAACCCAGAT GCCACCTATG ATTACAGAAG 720 

TGATCTTTGG TCTTGTGGCA TTACAGCCAT TGAGATGGCA GAAGGTGCTC CCCCTCTCTG 780 

TGACATGCAT CCAATGAGAG CACTGTTTCT CATTCCCAGA AACCCTCCTC CCCGGCTGAA 840 

GTCAAAAAAA TGGTCGAAGA AGTTTTTTAG TTTTATAGAA GGGTGCCTGG TGAAGAATTA 900 

CATGCAGCGG CCCTCTACAG AGCAGCTTTT GAAACATCCT TTTATAAGGG ATCAGCCAAA 960 

TGAAAGGCAA GTTAGAATCC AGCTTAAGGA TCATATAGAT CGTACCAGGA AGAAGAGAGG 1020 

CGAGAAAGAT GAAACTGAGT ATGAGTACAG TGGGAGTGAG GAAGAAGAGG AGGAAGTGCC 1080 

TGAACAGGAA GGAGAGCCAA GTTCCATTGT GAACGTGCCT GGTGAGTCTA CTCTTCGCCG 1140 

AGATTTCCTG AGACTGCAGC AGGAGAACAA GGAACGTTCC GAGGCTCTTC GGAGACAACA 1200 

GTTACTACAG GAGCAACAGC TCCGGGAGCA GGAAGAATAT AAAAGGCAAC TGCTGGCAGA 1260 

GAGACAGAAG OGGATTGAGC AGCAGAAAGA ACAGAGGCGA CGGCTAGAAG AGCAACAAAG 1320 

GAGAGAGCGG GAGOCTAGAA GGCAGCAGGA AOGTGAACAG CGAAGGAGAG AACAAGAAGA 1380 

AAAGAGGCGT CTAGAGGAGT TGGAGAGAAG GCGCAAAGAA GAAGAGGAGA GGAGACGGGC 1440 

AGAAGAAGAA AAGAGGAGAG TTGAAAGAGA ACAGGAQTAT ATCAGGCGAC AGCTAGAAGA 1500 

GGAGCAGCGG CACTTGGAAG TCCTTCAGCA GCAGCTGCTC CAGGAGCAGG CCATGTTACT 1560 

GCATGACCAT AGGAGGGCGC ACCCGCAGCA CTCGCAGCAG COGCCACCAC CGCAGCAGGA 1620 

AAGGAGCAAG CCAAGCTTCC ATGCTCCCGA GCCCAAAGCC CACTACGAGC CTGCTGACCG 1680 

AGCGCGAGAG GTTCCTGTGA GAACAACATC TCGCTCCCCT GTTCTGTCCC GTCGAGATTC 1740 

CCCACTGCAG GGCAGTGGGC AGCAGAATAG CCAGGCAGGA CAGAGAAACT CCACCAGTAT 1800 

TGAGCCCAGG CTTCTGTGGG AGAGAGTGGA GAAGCTGGTG CCCAGACCTG GCAGTGGCAG I860 

CTCCTCAGGG TCCAGCAACT CAGGATCCCA GCCCGGGTCT CACCCTGGGT CTCAGAOTGG 1920 

CTCCGGGGAA CGCTTCAGAG TGAGATCATC ATCCAAGTCT GAAGGCTCTC CATCTCAGCG 1980 

CCTGGAAAAT GCAGTGAAAA AACCTGAAGA TAAAAAGGAA GTTTTCAGAC CCCTCAAGCC 2040 

TGCTGGCGAA GTGGATCTGA CCGCACTGGC CAAAGAGCTT OGAGCAGTGG AAGATGTACG 2100 

GCCACCTCAC AAAGTAACGG ACTACTCCTC ATCCAGTGAG GAGTOGGGGA CGACGGATGA 2160 

GGAGGACGAC GATGTGGAGC AGGAAGGGGC TGACGAGTCC ACCTCAGGAC CAGAGGACAC 2220 

CAGAGCAGCG TCATCTCTGA ATTTGAGCAA TGGTGAAACG GAATCTGTGA AAACCATGAT 2280 

TGTCCATGAT GATGTAGAAA GTGAGCCGGC CATGACCCCA TCCAAGGAGG GCACTCTAAT 2340 

CGTCCGCCAG ACTCAGTCCG CTAGTAGCAC ACTCCAGAAA CACAAATCTT CCTCCTCCTT 2400 

TACACCTTTT ATAGACCCCA GATTACTACA GATTTCTCCA TCTAGCGGAA CAACAGTGAC 2460 

ATCTGTGGTG GGATTTTCCT GTGATGGGAT GAGACCAGAA GCCATAAGGC AAGATCCTAC 2520 

CCGGAAAGGC TCAGTGGTCA ATGTGAATCC TACCAACACT AGGCCACAGA GTGACACCCC 2580 

GGAGATTCGT AAATACAAGA AGAGGTTTAA CTCTGAGATT CTGTGTGCTG CCTTATGOGG 2640 

AGTGAATTTG CTAGTGGGTA CAGAGAGTGG CCTGATGCTG CTGGACAGAA GTGGCCAAGG 2700 

GAAGGTCTAT CCTCTTATCA ACCGAAGACG ATTTCAACAA ATGGACGTAC TTGAGGGCTT 2760 

GAATGTCTTG GTGACAATAT CTGGCAAAAA GGATAAGTTA CGTGTCTACT ATTTGTCCTG 2820 

GTTAAGAAAT AAAATACTTC ACAATGATCC AGAAGTTGAG AAGAAGCAGG GATGGACAAC 2880 

CGTAGGGGAT TTGGAAGGAT GTGTACATTA TAAAGTTGTA AAATATGAAA GAATCAAATT 2940 

TCTGGTGATT GCTTTGAAGA GTTCTGTGGA AGTCTATGCG TGGGCACCAA AGCCATATCA 3000 

CAAATTTATG GCCTTTAAGT CATTTGGAGA ATTGGTACAT AAGCCATTAC TGGTGGATCT 3060 

CACTGTTGAG GAAGGCCAGA GGTTGAAAGT GATCTATGGA TCCTGTGCTG GATTCCATGC 3120 

TGTTGATGTG GATTCAGGAT CAGTCTATGA CATTTATCTA CCAACACATG TAAGAAAGAA 3180 

CCCACACTCT ATGATCCAGT GTAGCATCAA ACCCCATGCA ATCATCATCC TCCCCAATAC 3240 

AGATGGAATG GAGCTTCTGG TGTGCTATGA AGATGAGGGG GTTTATQTAA ACACATATGG 3300 

AAGGATCACC AAGGATGTAG TTCTACAGTG GGGAGAGATG CCTACATCAG TAG CAT AT AT 3360 

TCGATCCAAT CAGACAATGG GCTGGGGAGA GAAGGCCATA GAGATCCGAT CTGTGGAAAC 3420 

TGGTCACTTG GATGGTGTGT TCATGCACAA AAGGGCTCAA AGACTAAAAT TCTTGTGTGA 3480 

ACGCAATGAC AAGGTGTTCT TTGCCTCTGT TCGGTCTGGT GGCAGCAGTC AGGTTTATTT 3540 

CATGACCTTA GGCAGGACTT CTCTTCTGAQ CTGGTAGAAG CAGTGTGATC CAGGGATTAC 3600 

TGGCCTCCAG AGTCTTCAAG ATCCTGAGAA CTTGGAATTC CTTGTAACTG GAGCTCGGAG 3660 

CTGCACCGAG GGCAACCAGG ACAGCTGTGT GTGCAGACCT CATGTGTTGG GTTCTCTCCC 3720 

CTCCTTCCTG TTCCTCTTAT ATACCAGTTT ATCCCCATTC TTTTTTTTTT TCTTACTCCA 3780 
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AAATAAATCA AGQCTGCAAT GCAOCTGGTG CTGTTCAGAT TCCAAAAAAA AAAAAAAACC 384 0 
ATGGTACCCG GATCCTCGAA TTCC 



Seq ID No: 89 Protein sequence: 
Protein Accession if: NP_004825.1 



1 11 21 31 41 51 

I I 1 I I I 

MANDSPAK6L VDIDLSSLRD PAQIPRLVBV VGNOTVGQVY XGRHVKTGQL AAIKVMDVTH 
DEEEEIKLBI NMLKKYSHHR NIATYYQAFI KKSPPGHDDQ LWLVMBFCGA GSITDLVKNT 
KjGNTLKEDWI AYISRSILRG IAHLHIHHVI HRDIKGQNVL LTENAEVKLV DFGVSAQLDR 
TVGRRNTPIG TPYWMAPBVI ACDENPDATY DYRSDLWSCQ ITAIBMAEGA PPLCDMHPMR 
AtiFLIPRNPP PKLKSKKHSK KFPSFIBGCL VKNYMQRP8T BQLLKHPPIR DQPNERQVRI 
QLKDHIDRTR KKRGEKDETE YBYSGSEEEB BBVPEQEGEP SSIVNVPGES TLRRDPLRLQ 
QKNKBRSEAL RRQQIdVQEQQ LREQEEYKRQ LLAERQKRIB QQKBQRRRLB EQQRREREAR 
RQQEREQRRR EQBEKRRIiEE LERRRKEBBE RRRAEEBKRR VEREQBYIRR QLEEBQRHLE 
VLQQQLLQEQ AMLLHDHRRP HPQHSQQPPP PQQERSKPSP HAPBPKAHYE PADRARSVPV 
RTT8RSPVLS RRDSPLQGSG QQNSQAGQRN STSIBPRLLW BRVEKLVPRP GSGSS5GSSN 
SGSQPGSHPG SQSGSGERPR VR8SSKSEGS PSQRLEKAVK KPEDKKEVFR PLKPAGEVDL 
TALAKBLRAV EDVRPPHKVT DYSSSSBESG TTDEEDDDVE QEGADESTSG PEDTRAASSL 
NLSNGBTESV KTMIVHDDVE SBPAMTPSKE GTLIVRQTQS ASSTLQKHKS SSSPTPPIDP 
RLLQISPSSG TTVTSWGFS CDGMRPBAIR QDPTRKGSW NVNPTNTRPQ SDTPBIRKYK 
KRFNSEILCA ALWGVNLLVG TESGLMLLDR SGQGKVYPLI NRRRPQQMDV LEGLNVLVTI 
SGKKDKLRVY YLSWLRNKIL HNDPBVEKKQ GWTTVGDLEG CVHYKWKYE RIKPLVIALK 
SSVEVYAMAP KPYHKFMAFK SFGELVHKPL LVDLTVEBGQ RUCVIYGSCA GFHAVDVDSG 
SVYDIYLPTH VRKNPHSMIQ CSIKPHAIII LPNTDGMBU, VCYBDEGVYV NTYGRITKDV 
VLQWGEMPTS VAYIRSNQTM GWGEKAIEIR SVBTGHLDGV PMHXRAQRLK FLCERKDKVF 
FASVRSGGSS QVYFMTLGRT SLLSW 



Seq ID NO: 90 DNA sequence 
Nucleic Acid Accession #< none found 
Coding sequence: 2-71 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I I I 

TTACACTTCA ATTCCTTACA OGOTATTTCA AACAAACAGT TTTGCTGAGA GGAGCTTTTG 60 

TCTCTCCT1A_AGAAAATGTT TATAAAGCTG AAAGGAAATC AAACAGTAAT CTTAAAAATG 120 

AAAACAAAAC AACCCAACAA CCTAGATAAC TACAGTGATC AGGGAGCACA GTTCAACTCC 180 

TTGTTATGTT TTAGTCATAT GGCCTACTCA AACAGCTAAA TAACAACACC AGTGGCAGAT 240 

AAAAATCACC ATTTATCTTT CAGCTATTAA TCTTTTGAAT GAATAAACTG TGACAAACAA 300 

ATTAACATTT TTGAACATGA AAGGCAACTT CTGCACAATC CTGTATCCAA GCAAACTTTA 380 

AATTATCCAC TTAATTATTA CTTAATCTTA AAAAAAATTA GAACCCAGAA CTTTTCAATG 420 

AAGCATTTGA AAGTTGAAGT GGAATTTAGG AAAGCCATAA AAATATAAAT ACTGTTATCA 480 

CAGCACCAGC AAGCCATAAT CTTTATACCT ATCAGTTCTA TTTCTATTAA CAGTAAAAAC 540 

ATTAAGCAAG ATATAAGACT ACCTGCCCAA GAATTCAGTC TTTTTTCATT TTTGTTTTTC 600 

TCAGTTCTGA GGATGTTAAT OGTCAAATTT TCTTTGGACT GCATTCCTCA CTACTTTTTG 660 

CACAATGGTC TCACGTTCTC AC AT TT GTT C TCGCGAATAA ATTGATAAAA GGTGTTAAGT 720 

TCTGTGAATG TCTTTTTAAT TATGGGCATA ATTGTGCTTG ACTGGATAAA AACTTAAGTC 780 

CACCCTTATG TTTATAATAA TTTCTTGAGA ACAGCAAACT GCATTTACCA TCGTAAAACA 840 

ACATCTGACT TACGGGAGCT GCAGGGAAGT GGTGAGACAG TTOGAACGGC TCCTCAGAAA 900 

TCCAGTGACC CAATTCTAAA GACCATAGCA CCTGCAAGTG ACACAACAAG CAGATTTATT 960 

ATACATTTAT TAGCCTTAGC AGGCAATAAA CCAAGAATCA CTTTGAAGAC ACAGCAAAAA 1020 

GTGATACACT CCGCAGATCT GAAATAGATG TGTTCTCAQA CAACAAAGTC CCTTCAGAAT 1080 
CTTCATGTTG CATAAATGTT ATGAATATTA ATAAAAAGTT GATTGAGA 



Seq ID No: 91 Protein sequence? 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

YTSIPYTVPQ TNSFAERSFC LSL 



Seq ID NO: 92 DNA sequence 
Nucleic Acid Accession #: NM_003 706.1 
Coding sequence i 310-1935 (underlined sequences correspond to start and atop codons) 



1 11 21 31 41 51 

I I I I I I 

CACGAGGCAG GGGCCATTTT ACCTCCAGGT TGGCCCTGCT CAGGACCAGG AGGAAACACC 60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
640 
900 
960 
1020 
1080 
1140 
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TCCAGCCCGC GACCTCCTCC CACAGGGGGA AAAGGAAAGC AGGAGGACCA CAGAAGCTTT 120 

GGCACCGAGG ATCCCCGCAO TCTTCACCCG CGGAGATTCC GGCTGAAGGA GCTGTCCAjOC 180 

GACTACACCG CTAAGCGCAG GGAGCCCAAG CCTCCGCACC GGATTCCGGA GCACAAGCTC 240 

CACCOCGCAT GCGCACACGC CCCAGACCCA GGCTCAGGAG GACTGAGAAT TTTCTGACCG 300 

CAGTGCACCA TGQGAAGCTC TGAAGTTTCC ATAATTCCTG GGCTCCAGAA AGAAGAAAAG 360 

GOGGCOGTGG AGAGACGAAG ACTTCATGTG CTGAAAGCTC TGAAGAAGCT AAGGATTGAG 420 

GCTQATGAGG CCCCAGTTGT TGCTGTGCTG GGCTCAGGCG GAGGACTGCG GGCTCACATT 480 

GCCTGCCTTG GGGTCCTGAQ TGAGATGAAA GAACAGGGCC TGTTGGATGC CGTCACGTAC 540 

CTCGCAGGGG TCTCTGGATC CACTTGGGCA ATATCTTCTC TCTACACCAA TGATGGTGAC 600 

ATGGAAGCTC TCGAGGCTGA CCTGAAACAT CGATTTACCC GACAGGAGTG GGACTTGGCT 660 

AAGAGCCTAC AGAAAACCAT CCAAGCAGCG AGGTCTGAGA ATTACTCTCT GACCGACTTC 720 

TGGGCCTACA TGGTTATCTC TAAGCAAACC AGAGAACTGC CGGAGTCTCA TTTGTCCAAT 780 

ATGAAGAAGC CCGTGGAAGA AGGGACACTA CCCTACCCAA TATTTGCAGC CATTGACAAT 840 

GACCTGCAAC CTTCCTGGCA GGAGGCAAGA GCACCAGAGA CCTGGTTCGA OTTCACCCCT 900 

CACCACGCTG GCTTCTCTGC ACTGGGGGCC TTTGTTTCCA TAACCCACTT CGGAAGCAAA 960 

TTCAAGAAGG GAAGACTGGT CAGAACTCAC CCTGAGAGAG ACCTGACTTT CCTGAGAGGT 1020 

TTATGGGGAA GTGCTCTTGG TAACACTGAA GTCATTAGGG AATACATTTT TGACCAGTTA 1080 

AGGAATCTGA CCCTGAAAGG TTTATGGAGA AGGGCTGTTG CTAATGCTAA AAGCATTGGA 1140 

CACCTTATTT TTGCCCGATT ACTGAGGCTG CAAGAAAGTT CACAAGGGGA ACATCCTCCC 1200 

CCAGAAGATG AAGGCGGTGA GCCTGAACAC ACCTGGCTGA CTGAGATGCT CGAGAATTGG 1260 

ACCAGGACCT CCCTGGAAAA GCAGGAGCAG CCCCATGAGG ACCCCGAAAG GAAAGGCTCA 1320 

CTCAGTAACT TGATGGATTT TGTGAAGAAA ACAGGCATTT GCGCTTCAAA GTGGGAATGG 1380 

GGGACCACTC ACAACTTCCT GTACAAACAC GGTGGCATCC GGGACAAGAT AATGAGCAGC 1440 

CGGAAGCACC TCCACCTGGT GGATGCTGGT TTAGCCATCA ACACTCCCTT CCCACTCGTG 1500 

CTGCCCCCGA CGCGGGAGGT TCACCTCATC CTCTCCTTOG ACTTCAGTGC CGGAGATCCT 1560 

TTCGAGACCA TCCGGGCTAC CACTGACTAC TGCCGCOQCC ACAAGATCCC CTTTCCCCAA 1620 

GTAGAAGAGG CTGAGCTGGA TTTGTGGTCC AAGGCCCOCQ CCAGCTGCTA CATCCTGAAA 1680 

GGAGAAACTG GACCAGTGGT GATACATTTT CCCCTGTTCA ACATAGATGC CTGTGGAGGT 1740 

OATATTGAGG CATGGAGTGA CACATACGAC ACATTCAAGC TTGCTGACAC CTACACTCTA 1800 

GATGTGGTGG TGCTACTCTT GGCATTAGCC AAGAAGAATG TCAGGGAAAA CAAOAAGAAG 1860 

ATCCTTAGAG AGTTGATGAA CGTGGCCGGG CTCTACTACC CGAAGGATAG TGCCCGAAGT 1920 

TGCTGCTTGG CATAGATQAQ CCTCAGCTTC CAGGGCACTG TGGGCCTGTT GGTCTACTAG 1980 

GGCCCTGAAG TCCACCTGGC CTTCCTGTTC TTCACTCCCT TCAGCCACAC GCTTCATGGC 2040 

CTTGAGTTCA CCTTGGCTGT CCTAACAGGG CCAATCACCA GTGACCAGCT AGACTGTQAT 2100 

TTTGATAGCG TCATTCAGAA GAAGGTGTCC AAGGAGCTGA AGGTGGTGAA ATTTGTCCTG 2160 

CAGGTCCCTC GGGAGATCCT GGAGCTGGAG CATGAGTGTC TGACAATCAG AAGCATCATG 2220 

TCCAATGTCC AGATGGCCAG AATGAATGTG ATAGTTCAGA CCAATGCCTT CCACTGCTCC 2280 

TTTATGACTG CACTTCTAGC CAGTAGCTCT GCACAAGTTA GCTCTGTAOA AGTAAGAACT 2340 

TGGGCTTAAA TCATGGGCTA TCTCTCCACA GCCAAGTGGA GCTCTGAGAA TACAACAAGT 2400 

GCTCAATAAA TGCTTGCTGA TTGACTGATG AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 



Seq ID No: 93 Protein sequence i 
Protein Accession #: NP_003697.1 



1 11 21 31 41 51 

I I I i I I 

MGSSEVSIIP GLQKEEKAAV ERRRLHVLKA LKKLRIEADE APWAVLGSG GGLRAHIACL 60 

GVLSEKKEQG LLDAVTYIAG VSGSTWAISS LYTNDGDMKA LEADLKHRPT RQEWDLAKSL 120 

QKTIQAARSE NYSLTDFWAY MVISKQTREL PBSHLSNNKK PVBBGTLPYP IPAAIDNDLQ 180 

PSWQEARAPE TWPEFTPHHA GFSALGAFVS ITHPGSKFKK GRLVRTHPER DLTFLRGLWG 240 

SALGNTEVIR EYIFDQLRNL TLKGLWRRAV AKAKSIGHLI FARLLRLQBS SQGEHPPPSD 300 

EGGBPEHTWL TEMIiENWTRT SLEKQEQPHE DPERKGSLSN LMDFVKKTGI CASKWEWGTT 360 

HNFLYKHGGI RDKIMfiSRKH LHLVDAGLA1 NTPFPLVLPP TREVHLIL9F DPSAGDPFET 420 

IRATTDYCRR HKIPFPQVEB AELDLWSKAP ASCYILKGET GPWIHFPLF NIDACGGDIE 480 

AWSDTYDTFK LADTYTLDW VLLLALAKKN VRENKKKILR BLMNVAGLYY PKDSARSCCL .540 
A 



Seq ID NOi 94 DNA sequence 
Nucleic Acid Accession #: AK027351 

Coding sequence: 1-642 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

AGGGAAAAAA ACTCCATTAA AAAGCCCAGC 
TGAGAAAGAT TTAGCAAAAT TCCACCGTAT 
AGTAAAACCC TCAGGCTGCT GAAATTTCTA 
AAATGAGGGT TTCTTAACTC ACACTQAGAO 
CTCAAGTGTG TGTTACCTTT CTTTACCAGC 
GGACATGTCT GTATGATCCA AGGTACCCAA 
TGGCTTTCTO CCGCTTCATG TGCTTTGGAA 
CCCAGCAGCT GGAGCCGCAA GAATGAACTG 
CAGGGGCAAC GACGAGAAGA AGATGTTGAA 



31 41 . 51 

I I I 

TTTCCTCCAT GTTAGATGTG ACTTGGAAAA 60 

CTTTTGCCAG GCTAGAGACA GGGAGAGCAG 120 

GGCTGTTAGG AAGCCCCTCG AATTCTGTGA .180 

CGGAAAGGGG CAGACCCTTT TCATAACTCC 240 

ATGGTAAGCA ACAGGACATA TCCCAGCCTC 300 

AGTCAGACAG AGTAAACTCA AGCCTGGCAC 360 

AAAGCAGGAG AAGCAATAGC AGCAGGAGTC 420 

CAAAGAGGGA ACTGACAGCA GCTGCGGCTG 480 

GTGTGTGGTG GTGGGGGAOG GTGCOGTGGG 540 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



GAAAACCTGC 
TGTGTTTGAC 
GTATGACACC 
GGATQTGTTT 
GGAATGGGTC 
CCAGATTGAT 
ACCTCTCACT 
GGAATGTTCA 
CATTTTCCAC 
TATCTGAGGT 
TCTGTGGCCA 
GAGGCTTGCC 
ACOACCTCCC 
GATTGCAGAC 
TTACAAATCC 
AAGAAACTTG 
CAATACTAAC 
TTAOTTCAAG 
AGCCTAAAGA 
TTAQAACCAA 
GTTTTGGAAA 
GTTGCAGGTC 
TTTTTCCATG 
TGTTTTTCAT 
TGTATTCAGT 
' CCTTQATTTC 
ATAAAAGAAA 
TQCCTTAATT 
CTCATTCCAT 
TTCCTTACTC 
TATCCATTTG 
AAAGCAAAAG 
CAGGAATTTT 
CTTCTAATTA 
TTTTTTTCAG 
ACTTTCCAOA 
CTGGAATAAG 
CAACACTTAG 
GGGTAATATA 
ACACATACCT 
AAGGTAAGGC 
AGAACCTGTC 
CCATCTAATT 
AGTTTTCAGT 
AATCAACTGT 
AATGAACTTT 
ATAGTTATAT 
CAGTATGTTT 
TAATGGGCCA 
ACTGGATAGA 
CTAATTACTG 



CTGCTGATGA 
CACTATGCAG 
GCGGGACAGG 
TTGATCTGCT 
CCCGAGCTCA 
CTCCGTGATG 
TACGAGCATG 
GCTCTGACTC 
COCAAGAAAA 
TGTCTGGGAC 
AGCTCCAGCC 
CCATCACCCT 
TGCCAOCCAG 
AGTGCOGCTO 
CCAGCTCATG 
ATTCCTCTAT 
CTTTTTTTCT 
AAATGTACTA 
CAAGGCATTT 
CACGTACCTC 
TTTACAGAGC 
CACATTTTTG 
TATTGCCACG 
CCAAGTTGAT 
AAAGTGGACA 
ACTTTCTCTC 
GATCTGAGCA 
TTTCTATTTG 
GTAAATGACA 
ATCCTCCCAA 
TCCCCTGCCC 
CTGCCAGTAA 
TGTATCATAG 
GTCTTAGTTG 
TTGCATGAGC 
TTCAAOCTCC 
CACTTAAGAA 
GGAGGCTGAG 
GTGAGATCCT 
GTAGTCCCAG 
TGCAGTGAGC 
AAAAAAAAAA 
GCTAAAGATT 
CAAAGCTTGA 
GTATGAACTA 
TAGTCCTGTA 
CTTGCTAAOG 
CCAGGTGTGT 
CAGGAAGTAA 
GACTGTTGTT 
AATAAAACAA 



GCTACGCCAA 
TTACTGTGAC 
AGGACTACAA 
TCTCTGTCGT 
AGGACTGCAT 
ACCCAAAAAC 
GTGTGAAOCT 
AGAAAOGTCT 
AGAAGAAAOG 
CTGCCTCCAC 
AAAAAGGAGG 
CTGAGCCCTC 
AAGCATCCGT 
CTGATCOCAT 
AACGTGAAGC 
TGCTGGCCTT 
GAATCTGCTG 
ATTTOCAGTT 
TATATTCATT 
TGAATGCCCG 
CATGATTTTT 
CCAAAGATAC 
ACAAACTAAA 
TGGGGGAAGA 
TTCCTGCTCC 
ATGCCOGGAT 
TAAAGATACG 
CTTCAACTGA 
TTTTCCAGTT 
ATGTCTTTGT 
TCCACAATGT 
GGAATCCTGT 
AGCGAATTAC 
CTTATAAGTG 
AAAGTGCTTC 
CACTGTTGGA 
TTGCGTGATA 
GTGGGTGGGC 
GTGTCTCTAT- 
CTACTCAGGA 
TGTGACTGTG 
AAAAAACAAC 
TTCTTTCATA 
CATTTAQAGA 
TAACTCTGCA 
ATAAATGAAA 
GGCCACTCAT 
GCACTCAACA 
GTTGATCTTG 
AGTGTCTGGG 
TCTAGAACAA 



CGACGCCTTC 
TGTGGGAGGC 
CCAGCTGAGG 
AAACCCTGCC 
GCCTCACGTG 
CTTGGCCCGT 
CGCAAAAGOG 
CAAAGOGGTT 
CTGTTCTGAG 
CCCATCCAGG 
GCACGACCAG 
CCAACACAGC 
ACTGCACQCT 
CAAAAACAAA 
TGATAGGAAA 
ACTTGATGTC 
TTCTACCCAT 
CACTCAGGCC 
TCTATTTTCA 
ATTATAAOAA 
GAACCTAATT 
ACTCTATAGA 
AATGAACTGT 
ATATGGCAGG 
TCCCTTCCCC 
CCTTTTATTC 
TGTTTAAAAA 
AAGTGCTTCT 
ACAACTGGTA 
GGGAGCCATA 
GTGACATAGA 
GCCCAATGAT 
TTCCTATCTT 
CCCTGGAATC 
TTAGTAGTGT 
AAAAGCCAGC 
GCCAGGCACC 
CGCTTGAGCT 
AAAAAAATTA 
OGCTGAGGTG 
CCACTACACT 
CTACATTTCA 
CGCACACACT 
AAACAAGGAC 
GAGGTTATGA 
TGTTATTAGG 
TTCTCACTGA 
GGCAAATAGC 
ATGGGGAGAT 
TAGAGCACAG 
AGCAA 



CCAGAGGAAT 
AAGCAACACT 
CCACTCTCCT 
TCTTACCACA 
CCTTATGTCC 
TTGCTGTATA 
ATCGGAGCAC 
TTTGATOAAG 
GGTGACAGCT 
GATGAGAATG 
AAAGGAACTC 
ACACTAGTCA 
GTCTGAGAAT 
OTCAAAOGCC 
TCACCCCAGG 
TTTTATAAAA 
GTGTCTCACA 
TTACTAATOC 
GCATGTTTCT 
GACATGAGAA 
GAAAGAAAAC 
TGCTTAGTAQ 
GTTTAAGAAT 
ATCCATCTTT 
CATTGCATGC 
TCCCCAGTTA 
TAACTAAAAG 
CAGCTCGCCC 
CTGAGATTTT 
TCAGTGGATA 
ACAGGGACTT 
GTAAAACAAT 
TTCATTAGAG 
ACCCAGGTAG 
GAAATTACAA 
CTTTCTAATC 
GTGGCTCATG 
CAGGAGTTCA 
AAAATTAGTC 
GAAGGATCAC 
CCAGCCTGAG 
AGTACTATTT 
CCAGTGACTG 
TTTCTGCCTT 
ATTCATCCTT 
CAGCTTTGTT 
TGTGGATGAA 
TCCCGAGGTC 
CACGTCACCC 
GCTCCCAGGG 



ACGTGCCCAC 
TGCTCGGACT 
ACCCCAACAC 
ATGTCCAGOA 
TCATAGGGAC 
TGAAAGAGAA 
AGTGCTACTT 
CAATCCTCAC 
GCTGTTCAAT 
GCAGCCAATC 
CCTTTGCACG 
GCCCACTGCC 
GCTGGGCCTG 
ATCTCACATT 
GAACCOGAAA 
CTTGGGACTA 
TTCATTTGTA 
ATACCAAATT 
ACCAAAGCTA 
GACTTTAAAA 
CATCTGAATT 
TGGCCTQATT 
GTAGTATTTC 
TACAGTATTT 
CCTCTTCCTC 
TAACCCAGTT 
TAAAGGAAAG 
CATGTAAGTT 
GCCTCTCTCT 
CCAAGCTCTG 
TGGCCCTGGG 
TCCAAACATC 
GCTATGAGQA 
GCACTTAATT 
CAACTTTAAG 
TCTTCTGCTA 
CCTGTAATCC 
AGACCAGCCT 
AGTTGTAGTG 
TTGAGCCCAG 
TGACAGAGAA 
CCCTTCTCTC 
GAAAAACGGG 
TATAAATGGA 
TACAAACAAT 
GCATGATTGC 
AAAATGAGAG 
ACCACTTCCC 
AGAACCAGCA 
GTCTTAAGAG 



600 
660 
720 
700 
840 
900 
960 
1020 

ioeo 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1S60 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 



55 



Seq ID No: 9S Protein sequence: 
Protein Accession ft: CAC066U.1 



1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDKKK MLKCVWGDG AVGKTCLLMS YANDAPPEEY VPTVPDHYAV 
60 TVTVGGKQHL LGLYDTAGQE DYNQLRPLSY PNTDVFLICP SWNPASYHN VQEEWVPBLK 
DCMPHVPYVL IGTQIDLRDD PKTIARIiLYM KEKPLTYBHG VKLAKAIGAQ CYLECSALTQ 
KGLKAVPDEA ILTIPHPKKK KKRCSBGHSC CSII 



60 
120 
180 



65 .Seq ID NO: 96 pNA sequence 

Nucleic Acid Accession #: NM_003654.1 

Coding sequence: 367-1602 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

70 | | | | | | 

GGGGAGGGCG CGGGAGGCGG AGGATGCCGC CGCGGCTGCT GCOGCCGCCO CCACCCGCGG 60 

GTCCCCGGCG ACCCTACTCC AGACCCGAGG ATGGAGCCGG CGCTGGGCGC TGCAGCTGCT 120 

CCCGGCGCGT CCCCGACCAG GTAGCTGGTG TCACTTOGGT GTGGTTGGAA GAAGACTTTC 180 

TCCCCAGCTG CATTCCCGGA GGCQCCCTTT CGACCTGGAG GCCGGOTCTG CTGGCCACAG 240 

75 GGCTGCCGCA CTGGCTGGGA CTGCCAGCTG GGCCTGGAGA CGCTGGTGGC TGTGGACTCC 300 

CCAGCTTGGA GCAGTCCCTC TTTGACCTCA CCCCTTGGAG AAGCAGCCCC ATGAAOGTGC 360 



225 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



CCAGCCATGC 
CAGTACACGG 
GAGGCCGGGC 
CGCAAGACCC 
CTCTTCAACC 
ACGCTCATCC 
GCCAGCOGCG 
ATCAAGCCGC 
OTCCTCTGCT 
GGQGACTGTQ 
QAGCGCAGCC 
CTGGTGGAAG 
ATTCTGGCTT 
OGCACGGGGA 
TTCTCCAACT 
TTGGTGCGCT 
TTCCTGGGCA 
GACCCCACCC 
AAGTGGCGCT 
OTGCTGGCCC 
GTCftGCCTGG 
GGGCGGGAGG 
TAACCCCTCC 
CTTCTGCCCC 
AGAGGGCGCC 
TCTCTGTGCG 
CGGGCACTCG 
TTTGCACTGT 
TTTATGAATG 
CCCTTAGAGC 
TTACTGTGAG 
GTCTGAGTCT 
TTGATCTCGG 
CAGTTTGCTA 
AGTGCAATAA 



AATGTTCCTG 
CCATCCGCAC 
TQGCCGAGCG 
ACATCCTCAT 
AGCACCTGGA 
CCCGCTTCAC 
ACCTCCTGCG 
OGCCGOTCAA 
CCCOGCCTGT 
T006CAAGTG 
ACOTOOCCAT 
ACCCGCQATT 
CGCGCAGCGA 
GGAAACCCTA 
CCGTGTCCAC 
ACGAGGACCT 
TCCCGCTOQA 
TOGGCAAOCA 
TCCGCCTCTC 
AGCTGOGCTA 
TGGAGGAGCQ 
CGCAAGGTGT 
CTCTCCCACC 
TTTTTTGTCT 
TGAAOTAGGQ 
GACGGTGACA 
CQAGGCGACT 
CTTACTATTC 
GTOTCCATCC 
A6C6AAACTG 
GTGAACGTGG 
CGTGGCCGCC 
GGTCCATCTG 
AATGAACATT 
TCACC 



GAAGGCCGTC 
CTTCACOQCC 
ACTGTGCGAG 
CCTGGCCACC 
OGTCTTCTAC 
CCAGGGCAAG 
GAGCCTCTAC 
CCACACCAOC 
GTGCGACCCT 
CGOOCTACTC 
CAAGACGGTG 
AAACCTCAAG 
GACCTTCCOC 
CAACCTGGAC 
CQGCCTCATG 
GGCTCGGAAC 
CAGCCACGTG 
CAAATACGGC 
CTACGACATC 
CAAGATCGCC 
GGACTTCCGC 
CGGTTTTGAT 
TCATCTTCGT 
CTGAAATTTG 
TCCCGCCCCC 
ATGTTTACAA 
TCTCAAGCTT 
AAGGTAAGAG 
TTTCCCCATC 
CCCCCTCCTG 
AOCTGTTTCT 
CCTGGACCAG 
TGATATTTCT 
GAAATTGAAA 



CTCCTCCTTG CCCTGGCCTC 
AAGTCCTTTC ACACCTGCCC 
GAGAGCCCCA CCTTCGCCTA 
ACGCGCAGCG GCTCCTCCTT 
CTGTTTGAGC CCCTCTACCA 
AGCCCGGCCG ACCGGCGGGT 
GACTGOGACC TCTACTTCCT 
GACAGGATCT TCC6C0GCG0 
CCGGGGCCAG CCGACCTGGT 
AACCTGACCG TGGCGGCOGA 
CGCGTGCCCG AGGTGAACGA 
OTCATCCACC TOGTCCGAGA 
OACACGTACC GGCTCTGGCG 
GTGACGCAGC TGACCACGGT 
OGGCCCCCGT GGCTCAAGGG 
CCTATGAAGA AGACCGAGGA 
GCCCGCTGGA TCCAGAACAA 
AOCGTGCGAA ACTCGGCGGC 
GTGGCCTTTG CCCAGAACGC 
GCCTCGGAGG AGGAGCTGAA 
CCCTTCTO OT GA CCOGGGOG 
AAAATGGACC GTTTTTAACT 
GTCCTTCCTG CCCCCAGCTC 
CACTACGTCT TGGAOGGGAA 
CCCACCCCAT TCAGACACAT 
GCACCACATT TACACATCCA 
TTGAATGGGT GAGTGGTCGG 
GATACAAACA AGAGGACCAC 
CCTGCCTCCT GCCCCTGACG 
CCCGCCCTTG CCTGTCGGTG 
GTTTCCAGTC TGTGGTGATG 
TGATGACTGA TGAATCTTAT 
TTGTGCCAAA AAGAAAAAAA 
TGCTTTATCT GTGTTTTCTG 



CATTGCCATC 
OGGOCTGGCA 
CAACCTCTCC 
CGTGGGCCAG 
CGTCCAGAAC 
CATGCTAGGC 
GGAGAACTAC 
GGCCAGCCGG 
CCTGGAGGAG 
GGOGTGCOGC 
CCTGCGCGCC 
CCCCCGOGGC 
GCTCTGGTAC 
GTGCGAGGAC 
CAAGTACATG 
GATCTACGGG 
CAOGCGGGGC 
CACGGCCGAG 
CTGCCAGCAG 
GAACCCCTCG 
GTGCGGGTGG 
GTTGCCTTAT 
ACCCCACTCC 
TCACTGGGGC 
GGATGTTGGG 
CACACGCACA 
GTATCTAGTT 
TTGTCTCTAA 
CCCATTTCCC 
AGGCAGGTTT 
CTGTCTGTCT 
GAGCTTCTGA 
AAGAGTGGAT 
TAAATAAAAG 



420 
460 
540 
€00 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



Seq ID Not 97 PfrfttQ&n gWHEnftOi 
Protein Accession ft: NP_003645.1 



1 11 21 31 41 51 

1 I I I I I 

MQCSWKAVLL LALASIAIQY TAIRTPTAKS FHTCPGLAEA GLAKRLCEES PTFAYNLSRK 
THILILATTR SGSSFVGQLP NQHLDVFYLP BPLYHVQNTL IPRPTQGKSP ADRRVMLGAS 
RDLLRSLYDC DLYFLENYIK PPPVNHTTDR IFRRGASRVL C8RPVCDPPG PADLVLESGD 
CVRKCGLUfL TVAAEACRER SHVAIKTVRV PBVNDLRALV EDPRLNLKVI QLVRDPRGIL 
ASRSETFRDT YRLWRLWYGT GRKPYNLDVT QLTTVCEDPS NSVSTGLMRP PWLKGKYMLV 
RYEDLARNPM KKTEEIYGFL GIPliDSHVAR WIQNNTRGDP TLGKHKYGTV RNSAATAEKW 
RFRLSYDIVA FAQNACQQVL AQDGYKIAAS EEELKNPSVS LVEERDPRPF S 



60 
120 
180 
240 
300 
360 



55 



60 



65 



70 



75 



8eq ID NO: 98 DNA sequence 
Nucleic Acid Accession #: NM_002852.1 
Coding sequence: 68-1213 (underlined 



sequences correspond to start and stop codons) 



1 
I 

CTCAAACTCA 
TCCAGCAATG 
GAACTCGGAT 
CCATCCCACT 
GCTCTTCATC 
CGACGTCCTG 
CCTGGCGAGG 
CGAGCTGCTG 
GGCGCAGCGC 
GACGCGAGCC 
TTGTGAAACA 
AGTOAGACCA 
ATTAAACAAA 
GTATCTCAGC 
TGAAGCCATG 
AGGGCTCACA 
AGGTCACATT 
TGTGGGTGGT 
CTGGGATAGT 



11 
I 

GCTCACTTGA 
CATCTCCTTG 
GATTATGATC 
GAGGACCCCA 
ATGCTGGAGA 
CGGGGCGAGC 
COGTGCGCGC 
CAGGOGACCC 
CCAGAGGAGG 
GACCTGCACG 
GCTATTTTAT 
ATGAGGCTTG 
ACCATCCTGT 
TACCAATCCA 
GTTTCCCTGG 
TCCTTGTGGG 
GTTCCTGAGG 
GGCTTTGATG 
GTTCTTAGCA 



21 
I 

GAGTCTCCTC 
CGATTCTGTT 
TCATGTATGT 
CGCCGTGCGA 
ACTCGCAGAT 
TGCAGAGGCT 
CGGGGGCTCC 
GCGACGCGGG 
CGGGGCGCGC 
CGGTGCAGGG 
TCCCAATGCG 
AGTCTTTTAG 
TTTCCTATGG 
TAGTGTTTGT 
GAAGGTGGAC 
TAAATGGTGA 
GAGGAATCCT 
AAACATTAGC 
ATGAAGAGAT 



31 
I 

CCGCCAGCTG 
TTGTGCTCTC 
GAATTTGGAC 
CTGCGGTCAG 
GAGAGAGCGC 
GCGGGAGGAG 
CGCAGAGGCC 
CCGCAGGCTG 
CCTQGCCGCG 
CTGGGCTGCC 



TGCCTGCATT 
CACAAAGAGG 
GGTGGGTGGA 
CCACCTGTGC 
ACTGGCGGCT 
GCAGATTGGC 
CTTCTCTGGG 
AAGAGAGACC 



41 

I 

TGGAAAGAAC 
TGGTCTGCAG 
AACGAAATAG 
GAGCACTCGG 
ATGCTGCTGC 
CTGGGCCGGC 
AGGCTGACCA 
GCGCGTATGG 
GTGCTAGAGG 
CGGAGCTGGC 
ATTTTTGGAA 
TGGGTCAAAG 
AATCCATATG 
GAGGAGAACA 
GGCACCTGGA 
ACCACTGTTG 
CAAGAAAAGA 
AGACTCACAG 
GGAGGAGCAG 



51 
I 

TTTGCGTCTC 
TGTTGGCCGA 
ACAATGGACT 
AATGGGACAA 
AAGCCACGGA 
TCGCGGAAAG 
GTQCTCTGGA 
AGGGCGCGGA 
AGCTGCGGCA 
TGCCGGCAGG 
GCGTGCATCC 
CCACAGATGT 
AAATCCAGCT 
AACTGGTTGC 
ATTCAGAGGA 
AGATGGCCAC 
ATGGCTGCTG 
GCTTCAATAT 
AGTCTTGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



226 



WO 02/079492 



PCT/US02/04915 



10 



CATCOGGGGG 
GTATGTTTCA 
AACACATOCC 
TGAAAGAGAG 
AAGGAAAGAC 
TTTCAGTTTA 
TGTTGAACAG 
GAATTTTACA 
TATGTACCTT 
ACTATAAATO 
AAQTTATATT 
AAATAAAATA 



AATATTGTTO 
7AAATGTTQT 
AGTTGGGAAG 
AOTTGAOACC 
ATTGGAAAAA 
ATGCTGTGTC 
AGGGACAATT 
TTGGAAGAAT 
ATTACAAAAA 
TAGTTTATGT 
GCAAAAGGGA 
TTTTATAAAA 



GGTGGGGAGT 
GAAACTCCAC 
GTCTGAAAAC 
AATCTTTATT 
GCTTTTGAGG 
TCTGTCAGAT 
GTTTTACTTT 
AACAAAATAA 
AAATGATGAA 
GTTATAATCG 
TTTGTATTAA 
CTAAAAAAAA 



CACAGAGATC 
TTGAAGCCAA 
TCAGTGCATA 
TGTACTGGCC 
ATAATGTTAC 
AAACTCTCAA 
TCTTTGGTTA 
GATTTGTTOT 
AACATATTTA 
AATGTCACGT 
TTTAAGACTA 
AAAAAAA 



CAGCCACATG 
AGAAAGAAAC 
ATAGGAACAC 
AAATACTGAA 
TAGACTTTAT 
ATAATTAAAA 
ATTTTGTTTT 
CCATTGTTCA 
TACTACAAGG 
TTTTGAGAAG 
TTTTTGTAAA 



GAGGAGCTCA 
TCACACTTAA 
TTGAGACTAA 
TAAACAGTTG 
GCCATGGTGC 
AGGACTGTAT 
GGCCAGAGAT 
TTGTTATTGG 
TGACTTAACA 
ATAGTCATAT 
GCTCTACTGT 



1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



15 



Seq ID Ho: 99 protean sequence: 
Protein Accession 8: NP_002843.1 



11 



21 
I 



31 



41 



51 



20 MHUjAILPCA LWSAVLAENS DDYDLMYVNL DNEIDNGLHP TSDPTPCDCG QEHSBWDKLF 60 

IMLENSQMRE RMLLQATDDV LRGBLQRLRE ELGRLAKSLA RPCAPGAPAE ARLTSALDEL 120 

LQATRDAGRR LARMEGAEAQ RPEEAGRAIiA AVLEKLRQTR ADLHAVQGWA ARSWLPAGCE 180 

TAILPPMRSK KITOSVHPVR PMRLESFSAC IWVKATDVLN KTILFSVGTK RNPYEIQLYL 240 

SYQSIVFWG GEENKLVABA MVSLGRWTHL CGTWNSEEGL TSLWVNGELA ATTVEMATGH 300 

25 IVPEGGILQI GQEKNGCCVG GGFDBTLAFS GRLTGFNIWD SVLSNEEIRE TGGAESCHIR 360 
GNIVGWGVTE IQPHGGAQYV 8 



Seq ID HO: 100 PWA ffcqueftfifl 
Nucleic Acid Accession #i NM_007351.1 
30 Coding sequence: 72-3758 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



65 



70 



75 



i 
I 

CTGCTATCAA 
AAACTACTGA 
GCATTGGGCT 
AGACTATGCC 
CCACTCGGGT 
GTCTTCTTAA 
ATCAAACTCT 
TCCCAACCAA 
CTACACTGAA 
ACACAGTTGG 
GAGCCCCACG 
ACCAAAAATC 
GGTTATCTCC 
CTTGTGGCTG 
ATAGGATGCA 
GGCCGAAATG 
AAAGTCATAC 
ACCCAGAAGT 
TTCTGCAGAA 
CCTCCCTAGA 
GTCTAAAATC 
TTTTTCAAAA 
CAGAGGACCT 
TAGCAGCCCA 
TGGAACTAAG 
CTATTAAAGA 
CAAGAAGCAT 
ATGAGCAGCT 
CAGTTAGCAA 
GTTTGATX3AT 
TCACCOTCTC 
CCAAATGCAQ 
TAAATCAAAC 
AGCAACTAAA 
CATCACTCAG 
AGATAGAAAA 
AAAGACACAA 
TCAATGAATA 
ATGCTATTGA 
ATAATAGTGA 
CTCAGTTCCA 



11 
I 

AAAGGCCATA 



TAACAACAGT 
TTCTGCTTCA 
CATGTOGGCG 
ATCAACACTG 
CACATCCACA 
CGCTAGCATC 
ATTTCTTCAG 
AGGCACTGGA 
GGAAACATAC 
AAATTTCGAA 
CACAGTGACA 
GACCGGTGGA 
ACATAAAATT 
TCAACTAAGA 
AGCTGTTGGC 
GATGCAAAAA 
GAAGATTGAC 
AGGAAAAGTC 
CAAAAGCATT 
TGACATGCAA 
CGAAAGCACC 
GCAAAAGTTT 
GAATCACATT 
ACTAGAAGTA 
TCTGTATTAT 
TTTATCAACT 
TAATGTCACT 
GCTGCAAATG 
TTTGGAGATG 
AAATGATTTT 
ATTGGCTGAA 
TGATTTGACT 
ACAGACAATG 
TCTQACTAGT 
CTTACTTAGA 
TGCCTTAGAA 
TTTCATTCAA 
GATCCATCAT 
CCGTCTGAAT 



21 

! 

AGGATTTTGT 
GCAAGATTAT 
AAGCATTCTT 
GTTCCTCCAA 
GAGATAGCTA 
CCTCCCTCAG 
GAGAAAGCAG 
AAGTTCAATC 
AGCTTTGCCA 
GGCATTGGAG 
CTCAGCCGGG 
ACAACTAGAG 
TTGGACAACC 
TCCTGTCCTC 
GTCACCTCAT 
GCCCAGGAAC 
AGAGGAGTAG 
ATGACTGATC 
AATATTTCTT 
AGCGAAGATA 
AATGTACTGA 
GAGACTGTAG 
AGQCAAATAA 
GTTTTGGTGC 
GTGAATGTAA 
AAGCAGACTC 
GAATCCCTCA 
GAACAGGTAT 
GAGTACATGT 
TTTGAAGATT 
GAGAAAGAGT 
AAATTTCAAC 
GTTCTCTTTC 
TATGATATGG 
ACATATQAAC 
GCTGTCAATA 
AATGAAGTAC 
ATGGAAGATG 
GATAACTATG 
AAATGTACCT 
GATTCTATTC 



31 
I 

CCCCAAATTT 
TTGTCCTTCT 
GGACTATACC 
ATAAAATACA 
CAACTCCAGA 
AAACAAGTGC 
AAGGAGTGGT 
CTGGAGCAGA 
GAAAGTCAAA 
GCGTTGGAGG 
GTGACAGCAG 
GAAAGAATTG 
AGGTCACTTA 
AGAGATCTCA 
TGGATTGGAG 
AGCAAAGTTT 
CTGAGCAGCA 
AGGTGAACTA 
TGACTGTGAA 
AAAGCAGAGA 
TAAGAGACAT 
CACAGCTCTT 
TTCAAAAAGT 
AAGAGAATCG 
GGCAAGAAAT 
ATTTAGAAGG 
ATAAAACTCT 
CAGACCAGAA 
CTACTTTACA 
TGCACATTCA 
CTCTCAGAGG 
TTAAGGACAC 
CAATGGACAA 
AGATCCTTCA 
AACCAAAGGA 
GTCTAAATTT 
AGGGTCGTGA 
GCCTCAATAA 
CCCTAAAAGA 
CCGATATGGA 
AGACTTTGGT 



41 

I 

CACATGAGCT 
TTCTAGTTTA 
TGAGGATGGG 
AAGTTTGCAA 
GGCAAGAACT 
ACCTGCTGAG 
CAAGTTACAG 
ATCAGTGGTC 
TGAACAAGCA 
CACTGGAGGC 
TTCCAGCCAA 
GTGTGCTTAT 
TGTCCCAGGT 
GAAGATATCC 
GTGCTGTCCT 
GATACACACC 
GCAGCAGCAA 
CCAGG CAATG 
TGATGTAAGG 
ATTTCAATCT 
AGTAAGAGAA 
CAAGACTGTA 
TAATGAATCT 
GCCCACTTTQ 
GACTCTTACA 
TGCTCTAGAA 
TTCTAAATTG 
GAATGCTCCA 
TGAAAATATA 
AGAAAGCAAG 
TGAATGTGAA 
AGAAGAGAAT 
TAAGATGGAC 
ACCCTTGCTT 
AGCAATAGTG 
TATTATCAAA 
TGATGCCTTA 
GACAATGACT 
GACTTTAAGT 
AACTATTTTG 
CAATGACAAT 



SI 
I 

ACCTTGCTTC 
TGGAGTGGGG 
AACTCTCAGA 
ATACTGCCAA 
TCTGAAGACA 
GGTGTGAGAA 
AATCTTACCC 
CTTTCCAATT 
ACTTCTCTAA 
GTGGGAAATC 
AGAACTGACT 
GTACATACCA 
GGGAAAGGAC 
AATCCTGTCT 
GGATACAGTG 
AACCAGGCTG 
GGCTGTGGTG 
AAACTGACTC 
AACACTTACT 
CTTCTAAAAG 
CAATTTAAAA 
TCAAGTCTAT 
GTGGTTTCAA 
ACTGATATAG 
TGTGAGAAGC 
CAGGAACACT 
AAGGAAGTAC 
GCTGCTGAGT 
AAGAAGCAGA 
ATTAACAATC 
GACATGTTAT 
TTACATGTGT 
AAAATGAGTG 
GAGCAGGGAG 
ATAAGGAAAA 
GAACTTACAA 
GAAAGACGTA 
ATTATAAATA 
ACTATTAAGG 
ACATTTATTC 
CAGAGATATA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



227 



WO 02/079492 



PCT/US02/04915 



ACTTTOTTTT 6CAAGTOQCC AAGACCCTTG CAGGTATTCC CAGAGATGAG AAACTAAATC 2520 

AGTCCAACTT CCAAAAGATG TATCAAATOT TCAATGAAAC CACTTCCCAA GTGAGAAAAT 2580 

ACCAGCAAAA TATGAGTCAT TTGGAAGAAA AACTACTCTT AACTACCAAG ATTTCCAAAA 2640 

ATTTTGAGAC TCGGTTGCAA GACATTGAGT CTAAAGTTAC CCAGACGCTC ATACCTTATT 2700 

ATATTTCAGT TAAAAAAGGC AGTGTAGTTA CAAATGAGAG AGATCAGGCT CTTCAACTGC 2760 

A AGTAT TAAA TTCCAGATTT AAGGCGTTGG AAGCAAAATC TATCCATCTT TCAATTAACT 2820 

TCTTTTCGCT TAACAAAACT CTCCACGAAG TTTTAACAAT GTGTCACAAT GCTTCTACAA 2880 

GTGTGTCAGA ACTGAATGCT ACCATCCCTA AGTGOATAAA ACATTCCCTG CCAGATATTC 2940 

AACTTCTTCA GAAAGGTCTA ACAGAATTTG TGGAACCAAT AATTCAAATA AAAACTCAAG 3000 

CTGCCCTATC TAATTCAACT TGTTGTATAG ATOGATCGTT GCCTGGTAGT CTGGCAAATG 3060 

TTGTCAAGTC TCAGAAGCAA GTAAAATCAT TGCCAAAGAA AATTAACGCA CTTAAGAAAC 3120 

CAACGGTAAA TCTTACCACA GTCCTOATAG GCCGGACTCA AAGAAACACG GACAACATAA 3180 

TATATCCTGA GGAGTATTCA AGCTQTAGTC GOCATCCGTG CCAAAATGGG GGCAOGTGCA 3240 

TAAATGGAAG AACTAGCTTT ACCTGTGCCT GCAGACATCC TTTTACTGGT GACAACTGCA 3300 

CTATCAAGCT TGTGGAAGAA AATGCTTTAG CTCCAGATTT TTCCAAAGGA TCTTACAGAT 3360 

ATGCACCCAT GGTGGCATTT TTTGCATCTC ATACGTATGG AATGACTATA CCTGOTCCTA 3420 

TCCTGTTTAA TAACTTGGAT GTCAATTATG QAGCTTCATA TACCCCAAGA ACTGGAAAAT 3480 

TTAGAATTCC GTATCTTGGA GTATATGTTT TCAAGTACAC CATCGAGTCA TTTAGTGCTC 3540 

ATATTTCTGG ATTTTTAGTG GTTGATGGAA TAGACAAGCT TGCATTTGAG TCTGAAAATA 3600 

TTAACAGTGA AATACACTGT GATAGGGTTT TAACTGGGGA TGCCTTATTA GAATTAAATT 3660 

ATGGGCAGGA AGTCTGGTTA CGACTTGCAA AAGGAACAAT TCCAGCCAAG TTTCCCCCTG 3720 

TTACTACATT TAGTGGCTAT TTATTATATC GTACATAAGT TAGTATGAAA AACAGACTAT 3780 

CACCTTTATT GAGAAACAGC CAGTGTTTTC ATTTATCTTT GCTTGCACAT CTGCTCTGTT 3840 

TTGGTTTTTC TACAGGAAAT GAAAATCAAC TTGTTTTTTT AATATGAGTA AACTTGTATG 3900 

TCTATTTTAT AAAATTATTT GAATATTGTT TAATGTCTGA ATATGAAAGA GTTCTTGATC 3960 

CTAAAGAAAT TTAGTGGCAC AGAAAACAAA GTGAATTTGT TAGCATAATT ATTCCTATTC 4020 

TTATTTCTTC ATTTTAAGTC ATTGCAATGG AAAGTAATAT TATAAAACGG TAATTACAAC 4080 

ATATTATCAG TCACAGTTTT CTTTCCAATT AAACACTTAA CTTTTGTTAT TCCCTGTATA 4140 
TAAATATATA ACACACATTT TCTAGATTCA CAAATTTAAA TAAATTACTC AAAAAATG 



Seq ID No: 101 Protein sequence! 
Protein Accession #: NP_031377.1 



1 11 21 31 41 51 

I I I I I I 

MKGARIiFVLIi SSLWSGGIGL HNSKHSWTIP EDGNSQKTHP SASVPPNKIQ SLQILPTTRV 60 

MSAEIATTPE ARTS EDS LLK STLPPSETSA PAEGYRKQTL TSTEKAEGW KLQNLTLPTN 120 

ASIKFNPGAE SWLSNSTLK PLQSFARKSN EQATSLNTVG GTGGIGGVGG TGGVGNRAPR 180 

ETYLSRGDSS SSQRTDYQKS NFETTRGKNW CAYVHTRLSP TVTLDNQVTY VPGGKGPCGW 240 

TGGSCPQRSQ KISKPVYRMQ HKTVTSLDWR CCPGYSGPKC QLRAQEQQSL IHTNQAESHT 300 

AVGRGVAEQQ QQQGCGOPBV MQKMTDQVNY QAMKLTLLQK KIDNISLTVN DVRNTY8SLE 360 

GKVSEDKSRB FQSLLKGLKS KSINVLIRDI VREQFKIFQN DMQETVAQLF KTVSSLSEDL 420 

ESTRQIIQKV NESWSIAAQ QKFVLVQBNR PTLTDIVELR NHIVNVRQEM TLTCEKPIKE 480 

LEVKQTHLEG ALEQEHSRSI LYYESLNKTL SKLKEVHEQL LSTEQVSDQK NAPAAESVSN 540 

NVTEYMSTLH ENIKKQSLMM LQMFEDLHIQ ESKINNLTVS LEMEKESLRG ECEDML3KCR 600 

NDPKFQLKDT EENLHVLNQT LAEVLFPMDN KMDKMSEQLN OLTYOMEILQ PLLEQGASLR 660 

QTMTYEQPKE AIVIRKKIEN LTSAVNSLNF IIKELTKRHN LLRNEVQGRD DALERRINEY 720 

ALEMEDGLNK TMTIINNAID PIQDNYAUCE TLSTIKDNSE IHHKCTSDME TILTPTPQPH 780 

RLNDSIQTLV NDNQRYNPVL QVAKTLAG1P RDEKUTQSNP QKMYQMFNET TSQVRKYQQN 840 

MSHLEEKLLL TTKISKNPET RLQDIESKVT QTLIPYYISV KKGSWTMER DQALQLQVLN 900 

SRFKALEAKS IHLSINFFSL NKTLHEVLTM CHNAST8VSE LHATIPKWIK HSLPDIQLLQ 960 

KGLTEPVEPI IQIKTQAAIiS NSTCCIDRSL PGSLANWKS QKQVKSLPKK INALKKPTVN 1020 

LTTVLIGRTQ RNTDNIIYPE BYSSCSRHPC QNGGTCINGR TSFTCACRHP FTGDNCTIKL 1080 

VEENAIiAPDF SKGSYRYAPM VAFFASHTYG MTIPGPILFN NLDVNYGASY TPRTGKFRIP 1140 

YLGVYVFKYT IESPSAHISG FLWDGIDKL AFESENINSE IHCDRVLTGD ALLELNYGQE 1200 
VWIiRLAKGTI PAKFPPVTTF SGYLLYRT 



Seq ID NOi 102 DNA sequence 

Nucleic Acid Accession #: NM_000873.2 

Coding sequence: 57-884 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

ATCTCCCTCC AGGCAGCCCT TGGCTGGTCC 
CCTCTTTCGG TTACAGGACC CTGACTGTGQ 
CGGATGAGAA GGTATTCGAG GTACACGTGA 
GGTCCCTCGA GGTCAACTGC AGCACCACCT 
CCTCTCTAAA TAAGATTCTG CTGGACGAAC 
ACATCTCCCA TGACACGGTC CTCCAATGCC 
TGAATTCCAA CGTCAGCGTG TACCAGCCTC 
CTTT8GTGGC TGTGGGCAAG TCCTTCACCA 
TGGACAGOCT CACCCTCTTC CTGTTCCGTG 
GGAAGGCAGC CCCTGCTCCG CAGGAGGCCA 
AGGATGGCCA CCGCAACTTC TCCTGCCTGG 
ACATCTTTCA CAAACACTCA GCCCCGAAGA 



31 41 51 

I I i 

CTGCGAGCCC GTGGAGACTG CCAGAQATGT 60 

CCCTCTTCAC CCTGATCTGC TGTCCAGGAT 120 

GGCCAAAOAA GCTGGCGGTT GAGCCCAAAG 180 

GTAACCAGCC TGAAGTGGGT GGTCTGGAGA 240 

AGGCTCAGTG GAAACATTAC TTGGTCTCAA 300 

ACTTCACCTG CTCCGGGAAG CAGGAGTCAA 360 

CAAGGCAGGT CATCCTGACA CTGCAACCCA 420 

TTGAGTGCAG GGTGCCCACC GTGGAGCCCC 480 

GCAATGAGAC TCTGCACTAT GAGACCTTCG 540 

CAGCCACATT CAACAGCACG GCTGACAGAG 600 

CTGTGCTGGA CTTGATGTCT CGCGGTGGCA 660 

TGTTGGAGAT CTATGAGCCT GTGTCGGACA 720 
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GCCAGATGGT CATCATAGTC ACGGTGGTGT CGGTGTTGCT OTCCCTOTTC GTGACATCTG 780 

TCCTGCTCTG CTTCATCTTC GGCCAGCACT TGCGCCAGCA 60GGATQGGC ACCTACGGGG 840 

TGCGAG06GC TTGGAGGAOQ CTGCCCCAG6 CCTTCCGGCC ATAGCAACCA TGAGTGGCAT 900 

GOCCACCACC ACGQTGGTCA CTGGAACTCA GTGTGACTCC TCAGGGTTGA GGTCCAGCCC 960 

TGGCTGAAGG ACTGTGACAG GCAGCAGAGA CTTGGGACAT TGCCTTTTCT AGCCCGAATA 1020 
CAAACACCTG GACTT 



10 



Seq ID No: 103 Prptq&n, flsquence; 
Protein Accesaion 8: NP 000864.1 



1 11 21 31 41 SI 

I I I I I I 

MSSPOYRTLT VALFTLICCP GSDEKVFEVH VRPKKIAVEP KGSLBVNC8T TCNQPEVQGL 60 
15 ETSLNKILLD EQAQWKHYLV SNISHDTVLQ CHFTCSGKQB SMNSNVSVYQ PPRQVTLTLQ 120 
PTIjVAVGKSF TIECRVPTVB PLDSIiTLFLF RGNETLHYET PGKAAPAPQE ATATFNSTAD 180 
REDGHRNFSC LAVLDLMSRG GNIFHKHSAP KMLEIYEPVS DSQMVIIVTV VSVLLSLPVT 240 
SVLLCPIFGQ HLRQQRMGTY GVRAAWRRLP QAFRP 

20 Seq ID NOi 104 DNA sequence 

Nucleic Acid Accession #: NM_001795.2 

Coding sequence: 121-2475 (underlined sequences correspond to start and stop codons) 



25 



30 



35 



40 



45 



50 



55 



60 



65 



k 70 



75 



i 

I 

GACGGTCGGC 
AGTCCAACGG 
ATGCAGAGGC 
GCAGCAGTGG 
ACCCACCGGC 
AACACCTCAC 
AAGTACCTOC 
GACGTGTTCG 
GTCATTGTGG 
GTTCATGACG 
CCTGAGTCGT 
CCCACTGTGG 
GCCATCGATA 
GCCAGGTATG 
ACGGCCACCG 
ACCAAGTACA 
TTTGTTGAGG 
GACTACCAGG 
CCCATGAAGC 
GACCCCACCA 
ATCAACATCA 
CTGAAGGAAA 
GCTAGGCATA 
GTCACAAAAA 
TATAACCTGA 
TCCATTGTGC 
AAGCCCTACC 
TCCGCAATAG 
GAGAACAACT 
GGGGAGTTTG 
GGQATGCCAA 
CAGGGCGAGT 
GTGGTAGCCA 
CGGCGGCGGC 
CAGCTGGTCA 
TCGGTGCTCA 
CGGCCTTCCC 
GGGCCCGGGG 
GACOGCCCCC 
GAGTCCCTCA 
AACGACTGGG 
GAGCTGCTGT 
CCAGGCCAGT 
GCACCCCTTC 
CCTGAAATAT 
CT3GTGTTCT' 
ACTCAAAGAC 
TQCAGGGTCT 
TATCTGCCTG 
CATTCCCAAG 



11 
I 

TGACAGGCTC 
AACAGAAACA 
TCATGATGCT 
CAGCAGCAGG 
GCCAAAAGAG 
TTCCCCATCA 
TCAAAGGAGA 
CCATTGAGAG 
ACAAGGACAC 
TGAACGACAA 
CGGCTGTGGG 
GAGACCAOGC 
ATTCTGGACG 
AGATCGTGGT 
TGCTGGTCAC 
CATTTGTCOT 
ACCCAGATGA 
ACGCTTTCAC 
CTCTGGATTA 
TCGACCTCCQ 
CAGATGTGGA 
ACCAGAAGAA 
GCATTGGATA 
AGGGGGACAT 
CTGTGGAGGC 
AAGTCCACAT 
AGCCCAAAGT 
ACAAGGACAT 
TTACCCTCAC 
ACCGGGAGCA 
GTCGCACGGG 
TCACCTTCTG 
TCTTACTCTG 
TCCGGAAGCA 
CCTACGACGA 
ACTCGGTGCQ 
TCTATGCGCA 
AGATGGCAGC 
CCTACGACAC 
GCTCCCTGGG 
GACCCAGGTT 
ATJAGGCGGC 
CAGACGCCAG 
CTCGTGGGTC 
CCAGGAATAT 
QTCTGGGCTC 
TTCCTCTGGC 
TTTTCTAGGG 
GAGGCAAAGG 
GGAGACTGAC 



21 
I 

CACAGAGCTC 
TCCCTCAGCC 
CCTOGCCACA 
TGCTAACCCT 
AGATTGGATT 
TGTAGGCAAG 
ATATGTGGGC 
GCTGGACCGG 
TGOTGAAAAC 
CTGGCCTGTG 
GACCTCAGTC 
CTCTGTCATG 
TATTATCACA 
GGAAGCGCGA 
TCTGCAAGAC 
GCCTGAAGAC 
GCCCCAGAAC 
CATTGAGACA 
TGAATACATC 
ATACATGAGC 
CGAGCCCCCC 
GCCTCTGATT 
CTCCATCCGC 
TTACAATGAG 
CAAAGAACTG 
TGAAGTTTTG 
GTGTGAGAAC 
AACACCACGA 
GGATAATCAC 
TACCAAGGTC 
CACCAGCACG 
CGAGGATATG 
CATCCTCACC 
GGCCCGCGCG 
GGAGGGCGGC 
CCGCGGCGGG 
GGTGCAGAAG 
CATGATOGAG 
GCTGCACATC 
CACCGACTCA 
TAAGATGCTG 
CGAGGTCACT 
GCACCACAGC 
CCAGAGACCT 
ATGTCAGTGA 
AGACATCCAC 
TCCCCAAGGC 
TCCCTGAACG 
CCTGGACAGC 
CATCATGCCC 



31 
I 

CACTCACGCT 
CCACAGGCAC 
TOGGGCGCCT 
GCCCAACGGG 
TGGAACCAGA 
ATCAAGTCAA 
AAGGTCTTCC 
GAGAATATCT 
CTGGAGACTC 
TTCACGCATC 
ATCTCTGTGA 
TACCAAATCC 
ATAACGAAAA 
GATGCCCAGG 
ATCAATGACA 
ACCCGTGTGG 
CGGATGACCA 
AACCCCGCCC 
CAGCAATACA 
CCTCCCGCGG 
ATTTTCCAGC 
GGCACAGTGC 
AGGACCAGTG 
AAAGAACTGG 
GATTCCACTG 
GATGAGAATG 
GCTGTCCATG 
AACGTGAAGT 
GATAACACGG 
CACTTCCTAC 



GCCGCCCAGG 
ATCACAGTGA 
CACGGCAAGA 
GGOGAGATGG 
GCCAAGCCCC 
CCACCGAGGC 
GTGAAGAAGG 
TACGGCTACG 
TCCQACTCTG 
GCTGAGCTGT 
CTGGGCCTGG 

ctccaaaaat' 
catcagcctt 

TGACTATTCT 
ATAACCCTGT 
TGCAAAGCAA 
CCCTGGTAAG 
TTGACTTGTG 
TCTCTCGGGA 



41 
I 

CAGGCCCTGG 
GATCTGTTCC 
GCCTGGGCCT 
ACACCCACAG 
TGCACATTGA 
GCGTGAGTCG 
GGGTCGATGC 
CAGAGTACCA 
CTTCCAGCTr 
GGTTGTTCAA 
CAGCAGTGGA 
TGAAGGGGAA 
GCTTGGACCG 
GCCTCCGGGG 
ACTTCCCCTT 
GCACCTCTGT 
AGTACAGCAT 
ACAACGAGGG 
GCTTCATCGT 
GAAACAGAGC 
AGCCTTTCTA 
TGGCCATGGA 
ACAAGGGCCA 
ACAGAGAAGT 
GAACCCCCAC 
ACAATGCCCC 
GCCAGCTGGT 
TCAAATTCAC 
CCAACATCAC 
CCGTGGTCAT 
CCGTGTGCAA 
TGGGCGTGAG 
TCACCCTGCT 
GCGTGCCGGA 
ACACCACCAG 
CGCGGCCCGC 
ACGCGCCTGG 
ACGAGGCGGA 
AGGGCTCCGA 
ACGTGGATTA 
ACGGCTCGGA 
GGACCCAAAC 
GGCAGTGACT 
GGGATAGCAA 
CAAATGCTGG 
CACCCACAGA 
AACAGACTGT 
GCTGGTGAGG 
GGGCAGGATT 
GCCCTAGCCC 



51 
I 

ACGGACAGGC 
TCCTGGGAAG 
GCTGGCAGTG 
CCTGCTGCCC 
TGAAGAGAAA 
CAAGAATGCC 
AGAGACAGGA 
CCTCACTGCT 
CACCATCAAA 
TGCGTCCGTG 
TGCAGAOGAC 
AGAGTATTTT 
AGAGAAGCAG 
GGACTCGGGC 
CTTCACCCAG 
GGGCTCTCTG 
CTTGCGGGGC 
CATCATCAAG 
CGAGGCCACA 
CCAGGTCATT 
CCACTTCCAG 
CCCTGATGCG 
GTTCTTCCGA 
CTACCCCTGG 
AGGAAAAGAA 
GGAGTTTGCC 
CCTGCAGATC 
CTTGAATACT 
AGTCAAGTAT 
CTCAGACAAT 
GTGCAACGAG 
CATCCAGGCA 
CATCTTCCTG 
GATCCACGAG 
CTACGATGTG 
GCTGGACGCC 
GGCACACGGA 
CCACGACGGC 
GTCCATAGCC 
CGACTTCCTT 
CCCCCGGGAG 
CCCCTGCAGC 
CCCCAGCCCA 
ACTCCAGGTT 
CAAATCCAGG 
CCGCCGTCTA 
GTTTAACTGC 
TCCTGGTGCC 
CTCTGCAGCC 
TGCTCCAACT 



60 
120 
180 
240 
300 
360 
420 

• 480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 • 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160' 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 
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CCATACTCCA CTCCAAGTGC CCCACCACTC CCCAACCCCT CTCCAGGCCT GTCAAGAGGG 3060 

AGGAAGGGGC CCCATGGCAG CTCCTGACCT TGGQTCCTGA AGTGACCTCA CTGGCCTGCC 3120 

ATGCCAGTAA CTGTGCTGTA CTGAGCACTG AACCACATTC AGGGAAATGG CTTATTAAAC 3180 

TTTQAAGCAA CTGTGAATTC ATTCTGGAGG GGCAGTGGAG ATCAOGAGTG ACAGATCACA 3240 

GGGTGAG6GC CACCTCCACA CCCACCCCCT CTGGAGAAGG CCTGGAAGAG CTOAQACCTT 3300 

GCTTTGAGAC TCCTCAGCAC CCCTCCAGTT TTGCCTGAGA AGGGOCAGAT GTTCCCGGAG 3360 

CAGAAGACGT CTCCCCTTCT CTGCCTCACC TGGTCGCCAA TCCATOCTCT CTTTCTTTTC 3420 

TCTQTCTACT CCTTATCCCT TOGTTTAGAG GAACCCAAGA TGTGGOCTTT AGCAAAACTG 3480 

GACAATGTCC AAACCCACTC ATGACTGCAT GACGGAGCCG AGCCATGTGT CTTTACACCT 3540 

CGCTGTTCTC ACATCTCAGG GAACTGACCC TCAGGCACAC CTTGCAGAAG GCAAGGCCCT 3600 

GCCCTGCCCA ACCTCTGTGG TCACCCATGC ATCTTCCACT GGAACGTTTC ACTGCAAACA 3660 

CACCTTGGAG AAGTGGCATC AGTCAACAGA GAGGGGCAGG GAAGGAGACA CCAAGCTCAC 3720 

CCTTCGTCAT GGACOGAGGT TCCCACTCTG GGCAAAGCCC CTCACACTGC AAGGGATTGT 3780 

AGATAACACT GACTTGTTTG TTTTAACCAA TAACTAGCTT CTTATAATGA TTTTTTTACT 3840 

AATGATACTT ACAAGTTTCT AGCTCTCACA GACATATAGA ATAAGGGTTT TTGCATAATA 3900 

AGCAGGTTGT TATTTAGGTT AACAATATTA ATTCAGGTTT TTTAGTTGGA AAAACAATTC 3960 

CTGTAACCTT CTATTTTCTA TAATTGTAGT AATTGCTCTA CAGATAATGT CTATATATTG 4020 

GCCAAACTGG TGCATGACAA GTACTGTATT TTTTTATACC TAAATAAAGA AAAATCTTTA 4080 
. GCCTGGGCAA GAAAAAAA 

Seq ID No: 105 Protein, aegyence: 
Protein Accession #: NP_001786.1 



1 11 21 31 41 51 

I 1 I I I I 

MQRLMMLLAT SGACLGLLAV AAVAAAGAHP AQRDTH5LLP THRRQKRDWI WNQMHIDBEK 60 

NTSLPHHVGK IKSSVSRKHA KYLLKGBYVG KVFRVDAETG DVPAIBRLDR ENISEYHLTA 120 

VIVDKDTGEN LETPSSFTIK VHDVNDNWPV FTHRLFNASV PESSAVGTSV ISVTAVDADD 180 

PTVGDHASVM YQILKQKEYF AIDNSGRIIT ITKSLDREKQ ARYEIWEAR DAQGLRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSL FVEDPDEPQN RMTKYSILRG 300 

DYQDAFTIET NPAHNEGIIK PMKPLDYEYI QQYSFIVSAT DPTIDLRYMS PPAGNRAQVI 360 

INITDVDEPP IFQQPFYHFQ LKENQKKPLI GTVLAMDPOA ARHSIGYSIR RTSDKGQFFR 420 

VTKKGDIYNE KELDREVYPW YNLTVEAKEL DSTGTPTGKE SIVQVHIEVL DENDMAPEPA 480 

KPYQPKVCEN AVHGQLVLQI SAIDKDITPR NVKFKFTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFIiPWISDN GNPSRTGTST LTVAVCKCNE QGEFTPCKDM AAQVGVSIQA 600 

WAIIiLCILT ITVITLLIFL RRRLRKQARA HQK8VPE1HB QLVTYDEEGG GEKDTTSYDV 660 

SVLNSVRRGG AKPPRPALDA RPSLYAQVQK PPRHAPGAHG GPGEMAAMIE VXKDBADHDG 720 

DGPPYDTLHI YGYEGSESIA BSLSSLGTDS SDSDVDYDFL NDWGPRFKML AELYGSDPRB 780 
ELLY 

Seq ID NO: 106 pNA sequence. 

Nucleic Acid Accession #: none found 

Coding sequence i 1-474 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I 1 I I I 

ACAGTACTCT GTGCAAAAAA CCTGGTGAAA AAGGATTTTT TCCGACTTCC TGATCCATTT 60 

GCTAAGGTGG TGGTTGATGG ATCTGGGCAA TGCCATTCTA CAGATACTGT GAAGAATACG 120 

CTTGATCCAA AGTGQAATCA GCATTATGAC CTGTATATTG GAAAGTCTGA TTCAGTTACG 180 

ATCAGTGTAT GGAATCACAA GAAGATCCAT AAGAAACAAG GTGCTGGATT TCTCGGTTGT 240 

GTTCGTCTTC TTTCCAATGC CATCAACCGC CTCAAAGACA CTGGTTATCA GAGGTTGGAT 300 

TTATGCAAAC TCGGGCCAAA TGACAATGAT ACAGTTAGAG GACAGATAGT AGTAAGTCTT 360 

CAGTCCAGAG ACCGAATAGG CACAGGAGGA CAAGTTGTGG ACTGCAGTCG TTTATTTGAT 420 

AACGATTTAC CAGAOGGAGC TCATTATTTG TGGACTTGGA AAGATAGATG TTAATGACTG 480 

GAAGGTAAAC ACCCGGTTAA AACACTGTAC ACCAGACAGC AACATTGTCA AATGGTTCTG 540 

GAAAGCTGTG GAGTTTTTTG ATGAAGAGCG AOGAGCAAGA TTGCTTCAGT TTGTGACAGG 600 

ATCCTCTCGA GTGCCTCTOC AGGGCTTCAA AGCATTGCAA GGTGCTGCAG GCCCGAGACT 660 

CTTTACCATA CACCAGATTG ATGCCTGCAC TAACAACCTG CCGAAAGCCC ACACTTGCTT 720 

CAATCGAATA GACATTCCAC CCTATGAAAG CTATGAAAAG CTATATGAAA AGCTGCTAAC 780 

AGCCATTGAA GAAACATGTG GATTTGCTGT GGAATGACAA GCTTCAAGGA TTTACCCAGG 840 
AC 

Seq ID No: 107 Protean, Sequence; 
Protein Accession ft: none found 



1 11 21 31 

I I I I 

TVLCAKNLVK KDFFRLPDPF AKVWDGSGQ CHSTDTVKNT 
ISVWNHKKIH KKQGAGFLGC VRLLSNAINR LKDTGYQRLD 
QSRDRIGTGG QWDCflRLFD NDLPDGAHYL WTWKDRC 

Seq ID NO: 108 DNA sequence 

Nucleic Acid Accesaion ft: m_0O23ie.l 

Coding sequence: 246-2572 (underlined sequei 



41 51 
I I 

LDPKWNQHYD LYIGKSDSVT 60 
LCKLGPNDND TVRGQIWSL 120 

ices correspond to start and stop codons) 
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1 11 21 31 41 51 

,1 I 1 I I I 

5 ACTCCAGCGC GCGOCTACCT ACGCTTGOTG CTTGCTTTCT CCAGCCATCQ GAGACCAGAG 60 

CCGCCCCCTC TGCTCGAGAA AGGGGCTCAG CGOCQQCGGA AGCGGAGGGG QACCACCGTG 120 

QAGAGCGCGG TCCCAGCCCG GCCACTGCOO ATCCCTGAAA CCAAAAAQCT CCTGCTOCTT 180 

CTGTACCCCG CCTGTCCCTC CCAGCTGOGC AGGGCCCCTT CGTGGGATCA TCAGCCCGAA 240 

QACAGQGATG GAGAGGCCTC TGTGCTCCCA CCTCTGCAGC TGCCTGGCTA TGCTGOCCCT 300 

10 CCTGTCCCCC CTGAGCCTGG CACAGTATGA CAGCTGGCCC CATTACCCCG AGTACTTCCA 360 

GCAACCGOCT CCTGAGTATC ACCAGCCCCA GGCCCCCGCC AACGTGGCCA AGATTCAGCT 420 

GOGCCTGGCT GGGCAGAAGA GOAAGCACAG CGAGG0CCG6 GTGGAGGTGT ACTATGATGG 480 

CCAGTGGGGC ACCGTOTOOQ ATGACGACTT CTCCATCCAC GCTGCCCACG TCGTCTGCCG 540 

GGAGCTGGGC TATGTGGAGG CCAAGTCCTG OACTGCCAGC TCCTCCTACG GCAAGGGAGA 600 

15 AGGGCCCATC TGGTTAGACA ATCTCCACTG TACTGGCAAC GAGGCGACCC TTGCAGCATG 660 

CACCTCCAAT GGCTGGGGOG TCACTOACTG CAAGCACACG OAGGATGTCG GTGTGGTGTG 720 

CAGOGACAAA AGGATTCCTG GGTTCAAATT TGACAATTCG TTGATCAACC AGATAOAGAA 780 

CCTCAATATC CAGGTGGAGG ACATTCGGAT TCGAGCCATC CTCTCAACCT ACCGCAAGCG 840 

CACCCCAGTG ATGGAGGGCT ACGTGGAGGT GAAGGAGGGC AAGACCTGGA AGCAGATCTG 900 

20 TGACAAGCAC TGGACGGCCA AGAATTCCCG CGTGGTCTGC GGCATGTTTG GCTTCCCTGG 960 

GGAGAGGACA TACAATACCA AAQTGTACAA AATGTTTGCC TCACGGAGGA AGCAGCGCTA 1020 

CTGGCCATTC TCCATGGACT GCACCGGCAC AGAGGCCCAC ATCTCCAGCT GCAAGCTGGG 1080 

CCCCCAGGTG TCACTGGACC CCATGAAGAA TGTCACCTGC GAGAATGGGC TGCCGGCCGT 1140 

GGTGAGTTGT GTGCCTGGGC AGGTCTTCAG CCCTGACGGA CCCTCGAGAT TCCGGAAAGC 1200 

25 ATACAAGGCA GAGCAACCCC TGGTGCGACT GAGAGGCGGT GCCTACATCG GGGAGGGCCG 1260 

CGTGGAGGTG CTCAAAAATG GAGAATGGGG GACCGTCTGC GACGACAAGT GGGACCTGGT 1320 

GTCGGCCAGT GTGGTCTGCA GAGAGCTGGG CTTTGGGAGT GCCAAAGAGG CAGTCACTGG 1380 

CTCCCGACTG GGGCAAGGGA TCGGACCCAT CCACCTCAAC GAGATCCAGT GCACAGGCAA 1440 

TGAGAAGTCC ATTATAGACT GCAAGTTCAA TGCOGAGTCT CAGGGCTGCA ACCACGAGGA 1500 

30 GGATGCTGGT GTGAGATGCA ACACCCCTGC CATGGGCTTG CAGAAGAAGC TGOGCCTGAA 1560 

CGGCGGCCGC AATCCCTACG AGGGCCGAGT GGAGGTGCTG GTGGAGAGAA ACGGGTCCCT 1620 

TGTGTGGGGG ATGGTGTGTG GCCAAAACTG GGGCATCQTG GAGGCCATGG TGGTCTGCCG 1680 

CCAGCTGGGC CTGGGATTCG CCAGCAACGC CTTCCAGGAG ACCTGGTATT GGCACGGAGA 1740 

TGTCAACAGC AACAAAGTGG TCATGAGTGG AGTGAAGTGC TCGGGAAOSG AGCTQTCCCT 1800 

35 GGCGCACTGC CGCCACGACG GGGAGGACGT GGCCTQCCCC CAGGGCGGAG TGCAGTACGG 1860 

GGCCGGAGTT GCCTGCTCAG AAACCGCCCC TGACCTGGTC CTCAATGCGG AGATGOTGCA 1920 

GCAGACCACC TACCTGGAGG ACCGGCCCAT QTTCATGCTO CAGTGTGCCA TGGAGGAGAA 1980 

CTGCCTCTCG GCCTCAGCCG CGCAGACCGA CCCCACCAOG GGCTACCGCC GGCTCCTGCG 2040 

CTTCTCCTCC CAGATCCACA ACAATGGCCA GTCCGACTTC CGGCCCAAGA ACGGCCGCCA 2100 

40 CGCGTGGATC TGGCAOGACT GTCACAGGCA CTACCACAGC ATGGAGGTGT TCACCCACTA 2160 

TGACCTGCTG AACCTCAATG GCACCAAGGT GGCAGAGGGC CACAAGGCCA GCTTCTGCTT 2220 

GGAGGACACA GAATGTGAAG GAGACATCCA GAAGAATTAC GAGTGTGCCA ACTTGGGCGA 2280 

tCAGGGCATC ACCATGGGCT GCTGGGACAT GTACOGCCAT GACATCGACT GCCAGTGGGT 2340 

TGACATCACT GACGTGCCCC CTGGAGACTA CCTOTTCCAG GTTGTTATTA ACCCCAACTT 2400 

45 OGAGGTTGCA GAATCCGATT ACTCCAACAA CATCATGAAA TGCAGGAGCC GCTATGACGG 2460 

CCACCGCATC TGGATGTACA ACTGCCACAT AGGTGGTTCC TTCAGCGAAG AGACGGAAAA 2520 

AAAGTTTGAG CACTTCAGOG GGCTCTTAAA CAACCAGCTG TCCCCGCAG T AAA GAAGCCT 2580 

GCGTGGTCAA CTCCTGTCTT CAGGCCACAC CACATCTTCC ATGGGACTTC CCCCCAACAA 2640 

CTGAGTCTGA AOGAATGCCA OGTGCCCTCA CCCAGCCCGQ CCCCCACCCT GTCCAGACCC 2700 

50 CTACAGCTGT GTCTAAGCTC AGGAGGAAAG GGACCCTCCC ATCATTCATG GGGGGCTGCT 2760 

ACCTGACCCT TGGGGCCTGA GAAGGCCTTG GGGGGGTGGG GTTTGTCCAC AGAGCTGCTG 2820 

GAGCAGCACC AAGAGCCAGT CTTGACCGGG ATGAGGCCCA CAGACAGGTT GTCATCAGCT 2880 

TGTCCCATTC AAGCCACCGA GCTCACCACA GACACAGTGG AGCCGCGCTC TTCTCCAGTG 2940 

ACACGTGGAC AAATGCOGGC TCATCAGCCC CCCCAGAGAG GGTCAGGCCG AACCCCATTT 3000 

55 CTCCTCCTCT TAGGTCATTT TCAGCAAACT TGAATATCTA GACCTCTCTT CCAATGAAAC 3060 

CCTCCAGTCT ATTATAGTCA CATAGATAAT GGTGCCACGT GTTTTCTGAT TTGGTGAGCT 3120 

CAGACTTGGT GCTTCCCTCT CCACAACCCC CACCCCTTGT TTTTCAAGAT ACTATTATTA 3180 

TATTTTCACA GACTTTTGAA GCACAAATTT ATTGGCATTT AATATTGGAC ATCTGGGCCC 3240 

TTGGAAGTAC AAATCTAAGG AAAAACCAAC CCACTGTGTA AGTGACTCAT CTTCCTGTTG 3300 

60 TTCCAATTCT GTGGGTTTTT GATTCAACGG TGCTATAACC AGGGTCCTGG GTGACAGGGC 3360 

GCTCACTGAG CACCATGTGT CATCACAGAC ACTTACACAT ACTTGAAACT TGGAATAAAA 3420 
GAAAGATTTA TG 

Seq ID No: 109 Protein ggguence; 
Oj Protein Accession #: NP_002309.1 



1 11 21 31 41 

7A I I I I I 

/0 MERPLCSHLC SCLAMLALLS PLSLAQYDSW PHYPBYFQQP APEYHQPQAP 
AGQKRXHSEG RVEVYYDGQW GTVCDDDFSI HAAHWCRBL GYVEAKSWTA 
IMLDNLHCTG NEATLAACTS NGWGVTDCKH TKDVOWCSD KRIPGFKPDN 
IQVEDIRIRA ILSTYRKRTP VMEGYVEVKE GKTWKQICDK HWTAKNSRW 
TYNTKVYKMF ASRRKQRYWP PSMDCTGTEA HISSCKLGPQ VSLDPMKNVT 
75 CVPGQVPSPD GPSRFRKAYK PEQPLVRLRG GAYIGEGRVB VLKNGBWGTV 
SWCREDGFG SAKBAVTGSR LGQGIGPIHL NEIQCTGNEK SIIDCKPNAE 



51 
I 

ANVAKIQLRL 60 

SSSYGXGSGP 120 

8LINQIENLN 180 

CGMFGFPGER 240 

CENGLPAWS 300 

CDDKWDLVSA 360 

SQGCNHEEDA 420 
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GVROJTPAMG LQKKLRENGG RNPYEGRVEV LVERNGSLVW GMVCOQNWGI VEAMWCRQL 480 

GLGPASNAFQ BTWYWBGDVN ENKWKSGVK CSGTELSIAH CRHDGEDVAC PQGGVQYGAG S40 

VAGSBTAPDL VLNAEMVQQT TYLEDRPMFM LQCAMHENCL SASAAQTDPT T0YRRLLRF3 600 

SQIHNNGQSD PRPKNGRHAH IWHDCHRHYH SMKVFTHYDL LNLNGTKVAE GHKASFCLBD 660 

TECEGDIQKN YECANFGDQG ITMGCWDMYR HDIDCQWVDI TDVPPGDYLF QWINPNFEV 720 
AESDYSNNTM KCRSRYDGHR IWKYNCHIGG SFSEETEKKF BHFSGLLNNQ LSPQ 

Seq ID NO: 110 DMA sequence 

Nucleic Acid Accession ft: none found, CAT_73007_3 

Coding sequence: 1-49S (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 SI 

1 I. I I I I 

flSSACGCGTG GOTCOACCCA CGCGTCCGCC CACGCGTCCG TATGGACAGA OCCTOCACTQ 
GCTGCTGCCT GCCCGCCACA TACCCAOCTG ACATGGGCAC CGCAGGAOCC ATGCAGCTGT 
CTGGCTGATC CTGGGCTTCC TCCTOTTCCO AGGCCACAAC TCCCAGCCCA CAATOACCCA 
ACCTCTAGCT CTCAGGGAGG CCTTGGCGGT CTAAGTCTOA CCACAGAGCC AOTTTCTTCC 
ACCCAGGATA CATCCCTTCC TCAOAGGCTA ACAGGCCAAG CCATCTOTCC AGCACTGGTA 
CCCAOGCGCA GGTGTCCCCA QCAQTG0AA6 AGACGGAGQC ACAAGCAGAG ACACATTTCA 
ACTGTTCCCC CCAATTCAAC CACCATOAGC CTGAGCATGA GGGAAGATGC GACCATCCTG 
CCAGCCCCAC GTCAGAGACT OTGCTCACTG TGGCTGCATT TGGGATGGAG TCGGGTGGAG 
aCCCACTCTG GCTAGGGGQC GGCAGGCTGA GAGCTCACCT GTTCAGCAGA GAAGTGGAAC 
CACTTTGCTC CTGGAGCC7G TCTACCACAG TGTTATCAGC TTCATTGTCA TCCTOGTGGT 
GTGGTGATCA TCCTAGTTGG TGTGGTCAGC CTGAGGGTTC AGTGTOGGAA GAGCAAGGAG 
TCTGAAGATC CCAGAACCTG GGAGTACAGG GCGTGTCTGA CAAGCTGGTC ACAGACCATG 
GCGAGAACGA CAGCATOGCC CATTATCACA TGGAAGACAT CACACGACTT AGGGCAACAC 
GCACTCAGCA GCGAGCATCA AAGGAGCCTA CX3CATGGCCC AGACTGAGAG CAAGCACAAA 
GGGC 

Seq ID Not 111 Protein sequence; 
Protein Accession #> none found, CAT_73007_3 



1 11 21 31 41 51 • 

I I I 1 I I 

RTRGSTHASA HASVWTEPPL AAACPPHTQL THAPQKPCSC LGDPGLPPVP RPQLPAHNDP 60 
T6SSQGGLGG LSLTTEPVSS TQDTSLPQRL TGQAICPALV PRRRCPQQWK RRRHKQRHIS 120 
TVPPNSTTMS LSMREDATIb PAPRQRLCSL WLHLGWSRVE AHSG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO i 112 DNA sequence 

Nucleic Acid Accession ft: NM_005424.l 

Coding sequence: 37-3453 {underlined sequences correspond to start an d stop codons) 



CGCTCGTCCT 
TTGCTCCCCA 
GCCAACCTGC 
GGGGCGGGGA 
ATCGTGCGCA 
ACGCTTCGCG 
GCTGGGGCGC 
CCAGACAAGG 
CACAAGGAGA 
GACTGGCATG 
TCGAGCGGCA 
OGGCTCATCG 
CCAGGTTGCC 
GGCTTCACTG 
CAGGAGCAGT 
TATGGCTGCT 
GGTCATTTTG 
CGGTTCAGTG 
CGGATCCCCC 
CGGATCAACT 
AAGCCAGACG 
GCTGAGTTCG 
TCCACATCTG 
CCCCTGGCTG 
GTCTCGTTCT 
AGTACCATGG 
CTGAGGCCAA 
GAGGGGGCCT 
CCGTGGTTGG 
CCCTTGGTGC 



11 
t 

GGCTGGCCTG 
TCCTCTTCTT 
GGCTCACGGA 
GGGGCTCGGA 
CCCCGCCCGG 
GCTTCTCCAA 
GGCGCACGOG 
TCACACACAC 
AGCAGACAGA 
AAGCCCAGGA 
TCTACAGTGC 
TGCGGGGTTG 
TACATGGAGG 
GCACCCGCTG 
GCCCAGGCAT 
CTTGTGGATC 
GGGCTGATTG 
GTTGTGTCTG 
AGATCCTCAA 
GTGCAGCTGC 
GCACTGTGCT 
AGGTGCCCCG 
GCGGCCAAGA 
CACCTCGGCT 
CTGGGGATGG 
ACTGGTCGAC 
AGACAGGATA 
GGGGGCCTCC 
AGGGCTGGCA 
CCGGGCCACT 



21 
I 

GGTCGGCCTC 
GGCTTCTCAT 
CCCCCAGCGC 
CGCCTGGGGC 
GCCACCCCTG 
GCCCTCGGAC 
CGTCATCTAC 
TGTGAACAAA 
CGTGATCTGG 
TGGGCGGTTC 
CACTTACCTG 
TGGGGCTGGG 
TGTCTGCCAC 
TGAACAGGCC 
ATCAGGCTGC 
TGGCTGGAGA 
CCGACTCCAG 
CCCCTCTGGG 
CATGGCCTCA 
AGGGAACCCC 
CCTGTCCACC 
CTTGGTTCTT 
CAGCCGGCGC 
CCTGACCAAG 
ACCCATCTCC 
CATTGTGGTG 
CAGTGTTCGT 
CACCCTCATG 
TGTGGAAGGC 
GGTGGGCGAC 



31 
I 

TGGAGT ATGG 
GTGGGCGCGG 
TTCTTCCTGA 
CCGCCCCTGC 
CGCCTGGCXJC 
CTCGTGGGCG 
GTGCACAACA 
GGTGACACCG 
AAGAGCAACG 
CTGCTGCAGC 
GAAGCCAGCC 
CGCTGGGGGC 
GACCATGACG 
TGCAGAGAGG 
CGGGGCCTCA 
GGAAGCCAGT 
TGCCAGTGTC 
TGGCATGGAG 
GAACTGGAGT 
TTCCCCGTGC 
AAGGCCATTG 
GCGGACAGTG 
TTCAAGGTCA 
CAGAGCCGCC 
ACTGTCCGCC 
GACCCCAGTG 
GTGCAGCTGA 
ACCACAGACT 
ACTGACCGGC 
GGTTTCCTGC 



41 
I 

TCTGGCGGGT 
CGGTGGACCT 
CTTGCGTGTC 
TGCTGGAGAA 
GCAACGGTTC 
TCTTCTCCTG 
GCCCTGGAGC 
CTGTACTTTC 
GATCCTACTT 
TCCCAAATGT 
CCCTGGGCAG 
CAGGCTGTAC 
GCGAATGTGT 
GCCGTTTTGG 
CCTTCTGCCT 
GCCAAGAAGC 
AGAATGGTGG 
TGCACTGTGA 
TCAACTTAGA 
GGGGCAGCAT 
TGGAGCCAGA 
GGTTCTGGGA 
ATGTGAAAGT 
AGCTTGrGGT 
TGCACTACCG 
AGAACGTGAC 
GCCGGCCAGG 
GTCCTGAGCC 
TGCGAGTGAG 
TGCGCCTGTG 



51 
I 

GCCCCCTTTC 
GACGCTGCTG 
TGGGGAGGCC 
GGACGACCGT 
GCACCAGGTC 
CGTGGGCGGT 
CCACCTGCTT 
TGCACGTGTG 
CTACACCCTG 
GCAGCCACCA 
CGCCTTCTTT 
CAAGGAGTGC 
ATGCCCCCCT 
GCAGAGCTGC 
CCCAGACCCC 
TTGTGCCCCT 
CACTTGTGAC 
GAAGTCAGAC 
GACGATGCCC 
AGAGCTACGC 
GAAGACCACA 
GTGCCGTGTG 
GCCCCCCGTG 
CTCCCCGCTG 
GCCCCAGGAC 
GTTAATGAAC 
GGAAGGAGGA 
TTTGTTGCAG 
CTGGTCCTTG 
GGACGGGACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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10 



15 



20 



25 



30 



35 



CGGGGGCAGG 
GGACTCACGC 
G6CC0GGCCT 
CACCTCCACG 
GCTCTGCCTG 
GACCCACTGT 
AACGCCAGCA 
AGCAACACAG 
GA6AGCCGG0 
GTGTCTGCCA 
AGCTGCCTGC 
CTGCAGTTCA 
CTGAGCTACC 
AACTTCGGCC 
ATCAAAATGC 
GAAGTTCTGT 
AACCGAGGTT 
CTGCGGAAAA 
TCTACCCTTA 
TACCTGAGTG 
GAGAACCTAG 
AAGAAOACGA 
GTCTATACCA 
CTTGGAGGTA 
GGCTACOGCA 
TGCTGGCGGG 
CGCATGCTGQ 
GCGGGCATTG 
GCTGGCCGGA 
CTTAAGCTGC 
GGTGGGCTTA 
TCCTTCTTTC 
CAAACCCCCA 
CAGCTACTCC 



AGOGGCGGGA 
CTGGCACCCA 
CGCCCCCTGC 
CCCAGGCCCT 
GGCCAATATC 
GGATAGACGT 
CGOGCTACCT 
TAGAAGAGTC 
CAGCTGAAGA 
CCTGCCTCAC 
ATCGGAGACG 
GCTCAGGGAC 
CAGTGCTAGA 
AGGTCATCCG 
TGAAAGAGTA 
GCAAATTGGG 
ACTTGTATAT 
OCCGGGTCCT 
GCTCCCGGCA 
AGAAGCAGTT 
CCTCCAAGAT 
TGGGGCGTCT 
CCAAGAGTGA 
CACCCTACTG 
TGGAGCAGCC 
ACCGTCCCTA 
AAGCCAGGAA 
ATGCCACAGC 
GCAAACTCTG 
CTCAAGGAAT 
GGGGAACTGG 
TAGTTCAGCT 
CTCCAGCTCC 
CACTCCCGGC 



GAACGTCTCA 
CTACCAGCTG 
ACAOGTGCTT 
CTCAGACTCC 
CAAGTAOGTT 
GGACAGGCCT 
CTTCOGCATS 
CACCCTGGGC 
GGGCCTGGAT 
CATCCTGGCC 
CACCTTCACC 
CTTGACACTT 
GTGGGAGGAC 
GGCCATGATC 
TGCCTCTGAA 
GCATCACCCC 
CGCTATTGAA 
AGAGACTGAC 
GCTGCTGCGT 
GATCCACAGG 
TGCAGACTTC 
CCCTGTGCGC 
TGTCTGGTCC 
TGGCATGACC 
TCGAAACTGT 
TGAGCGACCC 
GGCCTATGTG 
TGAGGAGGCC 
CTGTCTAACC 
TTTTTTAACT 
GTTCCCATGC 
GCCCCACAGG 
TTCGCTTAAG 
CTGTCATTCA 



TCCCCCCAGG 
GATGTGCAGC 
CTGCCCCCCA 
GAGATCCAGC 
GTGGAGGTGC 
GAGGAGACAA 
CGGGCCAGCA 
AACGGGCTGC 
CAGCAGCTGA 
GCCCTTTTAA 
TACCAGTCAG 
ACCCGGCGGC 
ATCACCTTTG 
AAGAAGGACG 
AATGACCATC 
AACATCATCA 
TATGCCCCCT 
CCAGCTTTTG 
TTCGCCAGTG 
GACCTGGCTG 
GGCCTTTCTC 
TGGATGGCCA 
TTTGGAGTCC 
TGTGCCGAGC 
GACGATGAAG 
CCCTTTGCCC 
AACATGTCGC 
TGAG CTGCCA 
TGTGACCAGT 
TAAGGGAGAA 
TTTGTAGGTG 
TGTGTTTCCC 
CCAGCACTCA 
GAAAAAAATA 



CCCGCACTGC 
TCTACCACTG 

TGACATGGAA 
AGGTGGCTGG 
GCACCATCAT 
TTCAGGGGCT 
AGGCTGAGGG 
TCCTGGCGGT 
CCCTGGTGTG 
GCTCGGGCGA 
CAAAACTGCA 
AOGACCTCAT 
GGCTGAAGAT 
GTGACTTTGC 
ACCTCCTGGG 
ACGGGAACCT 
CTCGAGAGCA 
ATGCGGCCAA 
CCCGGAATGT 
GGGGAGAGGA 
TTGAGTCCCT 
TTCTTTGGGA 
TCTATGAAAA 
TOTACGAGCT 
AGATTGCGCT 
TGTTTGAGAA 
TCCAGCCAGA 
CTGACCCTTA 
AAAAAGGGAT 
TCTCATAGCT 
ATCCCACTGC 
CACCACTAAC 
AATGTTCTAA 



CCTCCTGAGG 
CACCCTCCTG 
AGCCCCCOGA 
GCACCCGGAG 
GGGTGCAGGA 
COGTGGCCTC 
CGGGGACTQG 
CCCAGTCCAA 
GGTGGGCTCC 
CATCOGCAGA 
GGAGACCATC 
GCCCGAGCCC 
CGGGGAGGGG 
GAACGCAGCC 
GGGAGAACTG 
GGCCTGTAAG 
GCTAGATTTT 
TGGGACAGCC 
TGGCAT6CAG 
GCTGGTCGGA 
GGTTTATGTG 
GAACTACAGT 
GATAGTGAGC 
GCTGCCCCAG 
GATGCGTCAG 
ACAGCTAGGC 
CTTCACTTAC 
AOGTGGCTCT 
CAGCCTCTGA 
CTGGGGATGG 
ATCCTGGGCA 
TCCCCCAACA 
ATGCCCTGTT 
TAAGCTCCAA 



1860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 



Seq ID No: 113 Protajq aequence; 
Protein Accession #: np 005415.1 



40 
45 
50 
55 
60 
65 
70 
75 



MVWRVPPFLL 
LLLEKDDRIV 
NSPGAHLLPD 
QLPNVQPPSS 
DGECVCPPGP 
QCQBACAPGH 
EFNLETMPRI 
SGFWECRVST 
RLHYRPQDST 
DCPBPLLQPW 
QARTAU.TGL 
QLTWKHPEAL 
SIQGLGDWSN 
LTLVCIRRSC 
FEDLIGBGNF 
INLLGACKNR 
SDAANGMQYL 
AIESLNYSVY 
EVYELMRQCW 



11 
I 

PILFLASHVG 
RTPPGPPLRL 
KVTHTVNKGD 
GIYSATYLEA 
TGTRCEQACR 
PGADCRIiQCQ 
NCAAAGNPFP 
SGGQDSRRFK 
MDWSTIWDP 
LEGWHVEGTD 
TPGTHYQLDV 
PGPISKYWE 
TVEESTLGNG 
LHRRRTPTYQ 
GQVIRAMIKK 
GYLYIAIEYA 
8EKQPIHRDL 
TTKSDVWSFG 
RDRPYBRPPP 



21 
I 

AAVDLTLIAN 
ARNQSHQVTL 
TAVLSARVHK 
SPIGSAPPRL 
EGRPGQSCQB 
CQNGGTCDRF 
VRGSIELRKP 
VNVKVPPVPL 
SENVTLMNliR 
RLRVSWSLPL 
QLYHCTLLGP 
VQVAGGAGDP 
LQAEGPVQBS 
SGSGEBTILQ 
OGLKKNAAIK 
PYGNLLDFLR 
AARNVLVGEN 
VLLWEIVSLG 
AQIALQLGRM 



31 
I 

LRLTDPQRFP 
RGFSKPSDLV 
EKQTDVIWKS 
IVRGCGAGRW 
QCPGISGCRG 
SGCVCPSQWH 
DGTVLLSTKA 
AAPRLLTKQS 
PKTGYSVRVQ 
VPGPLVGDGP 
ASPPAHVLLP 
LWIDVDRPEE 
RAABEGLOQQ 
PSSGTLTLTR 
ML KEY AS END 
KSRVLETDPA 
LAEKIADFGL 
GTPYCGMTCA 
LEARKAYVNM 



41 

I 

LTCVSGEAGA 
GVFSCVGGAG 
NG9YPYTLDW 
GPGCTKECPG 
LTFCI4PDPYG 
GVHCEKSDRI 
IVEPEKTTAE 
RQLWSPLVS 
LSRPGEGGEG 
LLRLHDGTRG 
PSGPPAPRHL 
TSTIIRGLNA 
LILAWGSVS 
RPKLQPEPLS 
HRDFAGELEV 
FAREHGTAST 
SRGEEVYVKK 
ELYEKLPQGY 
SLFENFTYAG 



51 
I 

GRGSDAWGPP 
ARRTRVIYVH 
HEAQDGRPLL 
CLHGGVCHDH 
CSCGSGWRGS 
PQILNHASEL 
FEVPRLVLAD 
FSGDGPISTV 
AWGPPTLMTT 
QERRENVSSP 
HAQALSDSEI 
STRYLFRMRA 
ATCLTILAAL 
YPVLEWEDIT 
LCKLGHHPNI 
LSSRQLLRFA 
TMGRLPVRWM 
RMEQPRNCOO 
IDATAEEA 



60 
. 120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



Seq ID NOi 114 DMA sequence 

Nucleic Acid Accession #: NM_002632.1 

Coding sequence t 322-771 (underlined sequences correspond to start and Stop codons) 



1 
I 

GGGATTCGGG 
TCCCCGGGAC 
CCCAGCCACA 
CCACCGGCCG 
CTCCTCCTGT 
TGGCTCGGGA 
CTGGCCGGGC 
GGCTCGTCAG 



11 
I 

CCGCCCAGCT 
CCGCCTGCCC 
GCCTTACCTA 
GGGCCTCGGG 
GCCAGGGGCT 
CGTCTGAGAA 
TGGCGCTGCC 
AGGTGGAAGT 



21 
I 

ACGGGAGGAC 
CTCGGCGCCC 
CGGGCTCCTG 
GCAGCAGTGA 
CCCCGGGGGA 
QATGCCGGTC 
TGCTGTGCCC 
GGTACCCTTC 



31 
I 

CTGGAGTGGC 
CGCCCCGCCG 
ACTCCGCAAG 
GGGAGGCGTC 
TGAGCATGGT 
ATGAGGCTGT 
CCCCAGCAGT 
CAGGAAGTGT 



41 

I 

ACTGGGCGCC 
GGCCGCTCCC 
GCTTCCAGAA 
CAGCCCCCCA 
GGTTTTCCCT 
TCCCTTGCTT 
GGGCCTTGTC 



51 
I 

CGACGGACCA 60 

CGTCGGGTTC 120 

GATGCTCGAA 180 

CTCAGCTCTT 240 

CGGAGCCCCC 300 

CCTGCAGCTC 360 

TGCTGGGAAC 420 

CTACTGCCGG 480 
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GCGCTGGAGA GGCTGGTGGA CGTCGTGTCC GAGTACCCCA GCGAGGTGGA GCACATGTTC 
AGCCCATCCT OTGTCTCCCT GCTGCGCTGC ACOGGCTGCT GCGGCGATGA GAATCTGCAC 
TGTGTGCCGG TGGAGAOGGC CAATGTCACC ATGCAGCTCC TAAAQATCCG TTCTGGGGAC 
CGGCCCTCCT ACGTGGAGCT GAOGTTCTCT CAGCACGTTC GCTGCGAATG CCGGCCTCTG 
CGGGAGAAGA TGAAGCCGGA AAGGTGCGGC GAXGCTGTTC CCCQQAG GTA AC CCACCCCT 
TGGAGGAGAG AGACCCCGCA CCCGGCTCGT GTATTTATTA COGTCACACT CTTCAQTGAC 
TCCTGCTGGT ACCTGCCCTC TATTTATTAG CCAACTGTTT CCCTGCTGAA TGCCTOGCTC 
CCTTCAAGAC GAGGGGCAGG GAAGGACAGG ACCCTCAGGA ATTCAGTGCC TTCAACAACG 
TCAGAGAAAG AGAGAAGCCA GCCACAGACC CCTGGGAGCT TCCGCTTTGA AAGAAGCAAG 
ACACGTGGCC TCGTGAGGGG CAAGCTAGGC CCCAGAGGCC CTGGAGGTCT CCAGGGGCCT 
GCAGAAGGAA AGAAGGGGGC CCTGCTACCT GTTCTTGGGC CTCAGGCTCT GCACAGACAA 
GCAGCCCTTG CTTTOGGAGC TCCTGTCCAA AGTAGGGATG CGGATTCTGC TGGGGCCGCC 
ACGGCCTGOT GGTGGGAAGG CCGGCAGCGG GCGGAGGGGA TTCAGCCACT TCCCCCTCTT 
CTTCTGAAGA TCAGAACATT CAGCTCTGGA GAACAGTGGT TGCCTGGGGG CTTTTGCCAC 
TOCTTGTOCC CCGTGATCTC CCCTCACACT TTGCCATTTG CTTGTACTGG GACATTOTTC 
TTTCCGGCCG AGGTGCCACC ACCCTGCCCC CACTAAGAGA CACATACAGA GTGGGCCCCG 
GGCTSGAGAA AGAGCTGCCT GGATGAGAAA CAGCTCAGOC AGTGGGGATG AGGTCACCAG 
GGGAGGAGCC TGTGCGTCCC AGCTGAAGGC AGTGGCAGGG GAGCAGGTTC CCCAAGGGCC 
CTGGCACCCC CACAAGCTGT CCCTGCAGGG CCATCTGACT GCCAAGCCAG ATTCTCTTGA 
ATAAAGTATT CTAGTGTGGA AACGC , 

Seq ID No: 115 Pro^n, sequence;, 
Protein Accession #: NP_002623.1 

1 11 21 31 41 

I I I I I 

MPVMRLPPCP LQLLAGLALP AVPPQQWALS AGNGSSEVEV VPFQBVWGRS 
WSBYPSEVE HMFSPSCVSL LRCTGCCGDB NLHCVPVBTA NVTMQLLKIR 
TFSQHVRCBC RPLREKMKPE RCGDAVPRR 

Seq ID NO: llfi PNA sequence 
Nucleic Acid Accession *h NM_00736l.l 
Coding sequence: 1-4131 (underlined sequences correspond to start and atop codono ) 

1 11 21 31 41 51 

I I I I I I 

ATSGAGGGGG ACCGGGTGGC CGGGCGGCCG GTGCTGTCGT CGTTACCAGT GCTACTGCTG 60 

CTGCAOTTGC TAATGTTGCG GGCCGCGGCG CTGCACCCAG ACOAGCTCTT CCCACACGGG 120 

GAGTCGTGGT GGGACCAGCT CCTGCAGGAA GGCGACGACG TAAAGCTCAG CCGTGGTGAA 180 

GCTGGCGAAT CCCCTGCACT TCTTACGAAG CCCGATTCAG CAACCTCTAC GTGGGCACCA 240 

ACGGCATCAT CTCCACTCAG GACTTCCCCA GGGAAACGCA GTATGTGGAC TATGATTTCC 300 

CCACCGACTT CCCGGCCATC GCCCCTTTTC TGGCGGACAT CGACACGAGC CACGGCAGAG 360 

GCCGAGTCCT GTACCGAGAG GACACCTCCC CCGCAGTGCT GGGCCTGGCC GCCCGCTATG 420 

TGCGCGCTGG CTTCCCGCGC TCTGCGCGCT TTTTACCCCC ACCCACGCCT TCCTGGCCAC 480 

CTGGGAGCAG GTAGGCGCTT ACGAGGAGGT CAAACGCGGG CGCTGCCCTC GGGAGAGCTG 540 

AACACTTTCC AGGCAGTTTT GGCATCTGAT GGGTCTGATA GCTACGCCCT CTTTCTTTAT 600 

CCTGCCAACG GCCTGCAGTT CCTTGGAACC CGCCCCAAAG AGTCTTACAA TGTCCAGCTT 660 

CAGCTTCCAG CTCGGGTGGG CTTCTGCCGA GGGGAGGCTG ATGATCTGAA GTCAGAAGGA 720 

CCATATTTCA GCTTGACTAG CACTGAACAG TCTGTGAAAA ATCTCTATCA ACTAAGCAAC 780 

CTGGGGATCC CTGGAGTGTG GGCTTTCCAT ATCGGCAGCA CTTCCCCGTT GGACAATGTC 840 

AGGCCAGCTG CAGTTGGAGA CCTTTCCGCT GCCCACTCTT CTGTTCCCCT GGGACGTTCC 900 

TTCAGCCATG CTACAGCCCT GGAAAGTGAC TATAATGAGG ACAATTTGGA TTACTACGAT 960 

GTGAATGAGG AGGAAGCTGA ATACCTTCCG GGTGAACCAG AGGAGGCATT GAATGGCCAC 1020 

AGCAGCATTG ATGTTTCCTT CCAATCCAAA GTGGATACAA AGCCTTTAGA GGAATCTTCC 1080 

ACCTTGGATC CTCACACCAA AGAAGGAACA TCTCTGGGAG AGGTAGGGGG CCCAGATTTA 1140 

AAAGGCCAAG TTGAGCCCTG GGATGAGAGA GAGACCAGAA GCCCAGCTCC ACCAGAGGTA 1200 

GACAGAGATT CACTGGCTCC TTCCTGGGAA ACCCCACCAC CGTACCCCGA AAACGGAAGC 1260 

ATCCAGCCCT ACCCAGATGG AGGGCCAGTG CCTTCGGAAA TGGATGTTCC CCCAGCTCAT 1320 

CCTGAAGAAG AAATTGTTCT TCGAAQTTAC CCTGCTTCAG GTCACACTAC ACCCTTAAGT 1380 

CGAGGGACGT ATGAGGTGGG ACTGGAAGAC AACATAGGTT CCAACACCGA GGTCTTCACG 1440 

TATAATGCTG CCAACAAGGA AACCTGTGAA CACAACCACA GACAATGCTC CCGGCATGCC 1500 

TTCTGCACGG ACTATGCCAC TGGCTTCTGC TGCCACTGCC AATCCAAGTT TTATGGAAAT 1560 

GGGAAGCACT GTCTGCCTGA GGGGGCACCT CACCGAGTGA ATGGGAAAGT GAGTGGCCAC 1620 

CTCCACGTGG GCCATACACC CGTGCACTTC ACTGATGTGG ACCTGCATGC GTATATCGTG 1680 

GGCAATGATG GCAGAGCCTA CACGGCCATC AGCCACATCC CACAGCCAGC AGCCCAGGCC 1740 

CTCCTCCCCC TCACACCAAT TGGAGGCCTG TTTGGCTGGC TCTTTGCTTT AGAAAAACCT 1800 

GGCTCTGAGA ACGGCTTCAG CCTCGCAGGT GCTGCCTTTA CCCATGACAT GGAAGTTACA 1860 

TTCTACCCGQ GAGAGGAGAC GGTTCGTATC ACTCAAACTG CTGAGGGACT TGACCCAGAG 1920 

AACTACCTGA GCATTAAGAC CAACATTCAA GGCCAGGTGC CTTACGTCCC AGCAAATTTC 1980 

ACAGCCCACA TCTCTCCCTA CAAGGAGCTG TACCACTACT CCGACTCCAC TGTGACCTCT 2040 

ACAAGTTCCA GAGACTACTC TCTGACTTTT GGTGCAATCA ACCAAACATG GTCCTACCGC 2100 

ATCCACCAGA ACATCACTTA CCAGGTGTGC AGGCACGCCC CCAGACACCC GTCCTTCCCC 2160 

ACCACCCAGC AGCTGAACGT GGACCGGGTC TTTGCCTTGT ATAATGATGA AGAAAGAGTG 2220 

CTTAGATTTG CTGTGACCAA TCAAATTGGC CCGGTCAAAG AAGATTCAGA CCCCACTCCG 2280 

GTGAATCCTT GCTATGATGG GAGCCACATG TGTGACACAA CAGCACGGTG CCATCCAGGG 2340 

ACAGGTGTAG ATTACACCTG TGAGTGCGCA TCTGGGTACC AGGGAGATGG ACGGAACTGT 2400 
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GTGGATGAAA ATGAATGTGC AACTGGCTTT CATCGCTGTG GCCCCAACTC TGTATGTATC 2460 

AACTTGCCTG GAAGCTACAG GTGTGAGTGC CGGAGTGGTT ATGAGTTTGC AGATGACCGO 2520 

CATACTT6CA TCTTGATCAC CCCACCTQCC AACCCCTGTG AGGATGGCAQ TCATACCTGT 2580 

GCTCCTGCTO GGCAGGCCCG GTGTGTTCAC CATGGAGGCA OCACGTTCAG CTGTGCCTGC 2640 

5 CTGCCTGGTT ATGCOGGCGA TGGGCACCAG TGCACTGATG TAOATGAATG CTCAGAAAAC 2700 

AGATGTCACC CTGCAGCTAC CTGCTACAAT ACTCCTGGTT CCTTCTCCTQ CCGTTGTCAA 2760 

CCCGGATATT ATGGGGATGG ATTTCAGTGC ATACCTGACT CCACCTCAAG CCTGACACCC 2820 

TGTGAACAAC AGCAGCGCGA TGCCCAGGCC CAGTATGCCT ACCCTGGGGC CCGGTTCCAC 2880 

ATCCCCCAAT GOGAOGAOCA GGGCAACTTC CTGCCCCTAC AGTGTCATGG CAGCACTGGT 2940 

10 TTCTGCTGGT GCGTGGACCC TGATGGTCAT GAAGTTCCTG GTACCCAGAC TCCACCTGGC 3000 

TCCACCCCGC CTCACTGTGG ACCATCACCA GAGCCCACCC AGAGGCCCCC GACCATCTGT 3060 

GAGCGCTGGA GGGAAAACCT GCTGGAGCAC TACGGTGGCA CCCCCCGAGA TGACCAGTAC 3120 

GTGCCCCAGT GCGATGACCT GGGCCACTTC ATCCCCCTGC AGTGCCACGG AAAGAGCGAC 3180 

TTCTGCTGGT GTGTGGACAA AGATGGCAGA GAGGTGCAGG GCACCCGCTC CCAGCCAGGC 3240 

15 ACCACCCCTG CGTGTATACC CACCGTCGCT CCACCCATGG TCCGGCCCAC GCCCCGGCCA 3300 

GATGTGACCC CTCCATCTGT GGGCACCTTC CTGCTCTATA CTCAGGGCCA GCAGATTGGC 3360 

TACTTACCCC TCAATGGCAC CAGGCTTCAG AAGGATGCAG CTAAGACCCT GCTGTCTCTG 3420 

CATGGCTCCA TAATCGTGGG AATTGATTAC GACTGCCGGG AGAGGATGGT GTACTGGACA 3480 

GATGTTGCTG GACGGACAAT CAGCCGTGCC GGTCTGGAAC TGGGAGCAGA GCCTGAGACG 3540 

20 ATCGTGAATT CAGGTCTGAT AAGCCCTGAA GGACTTGCCA TAGACCACAT CCGCAGAACA 3600 

ATGTACTGGA CGGACAGTGT CCTGGATAAG ATAGAGAGCG CCCTGCTGGA TGGCTCTGAG 3660 

OGCAAGGTCC TCTTCTACAC AGATCTGGTG AATCCCCGTG CCATCGCTGT GGATCCAATC 3720 

CGAGGCAACT TGTACTGGAC AGACTGGAAT AGAGAAGCTC CTAAAATTGA AACGTCATCT 3780 

TTAGATGGAG AAAACAGAAG AATTCTGATC AATACAGACA TTGGATTGCC CAATGGCTTA 3840 

25 ACCTTTGACC CTTTCTCTAA ACTGCTCTGC TGGGCAGATG CAGGAACCAA AAAACTGGAG 3900 

TGTACACTAC CTGATGGAAC TGGACGGCGT GTCATTCAAA ACAACCTCAA GTACCCCTTC 3960 

AGCATCGTAA GCTATGCAGA TCACTTCTAC CACACAGACT GGAGGAGGGA TGGTGTTGTA 4020 

TCAGTAAATA AACATAGTGG CCAGTTTACT GATGAGTATC TCCCAGAACA ACGATCTCAC 4080 

CTCTACGGGA TAACTGCAGT CTACCCCTAC TGCCCAACAQ GAAGAAA OTA AG TACAGTAA 4140 

30 TGTAAAGGAA GACTTGGAGT TTACAATCAG AACCTGGACC CTAAAGAACA GTGACTGCAA 4200 

AGGCAAAGAA AGTAAAAAAG GAATTGGCCA TTAGACGTTC CTGAGCATCC AAGATGAACA 4260 

TTTTGTAGTG CAAAAAGACT TTTGTGAAAA GCTGATACCT CAATCTTTAC TACTGTATTT 4320 

TTAAAAATGA AGGTTGTTAT TGCAAGTTTA AAAAGGTAAC AGAATTTTAA CTGTTGCTTA 4380 

TTAAAGCAAC TTCTTGTAAA CATTTATCAT TAATATTTAA AAGATCAAAT TCATTCAACT 4440 

35 AAGAATTAGA GTTTAAGACT CTAAACCTGA TTTTTGCCAT GGATTCCTTC TGGCCAAGAA 4500 

ATTAAAGCAC ATGTGATCAA TATAACAATA TAATCCTAAA CCTTGACAGT TGGAGAAGCC 4560 

AATGCAGAAC TGATGGGAAA GGACCAATTA TTTATAGTTT CCCAACAAAA GTTCTAAGAT 4620 

TTTTTACCTC TGCATCAGTG CATTTCTATT TATATCAAAA GGTGCTAAAA TGATTCAATT 4680 

TGCATTTTCT GATCCTGTAG TGCCTCTATA GAAGTACCCA CAGAAAGTAA AGTATCACAT 4740 

40 TTATAAATAC CAAAGATGTA ACAATTTTAA AATTTTCTAG ATTACTCCAA TAAAGTGTTT 4800 
TAAGTTTAAA AAAAAAAAAA AAAAAAAAA 



Seq ID No: 117 Protein seance, ; 
40 Protein Accession #: NP_031387.1 

1 11 21 31 41 51 

I I I ' I I I 

MEGDRVAGRP VLSSLPVLLL LQLLMLRAAA LHPDELFPHG ESWWDQLLQE GDDVKLSRGE 

50 AGBSPALLTK PDSATSTWAP TASSPLRTSP GKRSMWTMIS PPTSRPSPLF WRTSTRATAE 
ABSCTERTPP PQCWAWPPAM CALASRALRA PYPHPRLPGH LGAGRRLRGG GTRALPSGEL 
NTFQAVLASD GSDSYALPLY PANGLQFLGT RPKESYNVQL QLPARVGPCR GBADDLKSEG 
PYPSLTSTEQ SVKNLYQLSN LGIPGVWAFK IGSTSPLDHV RPAAVGDLSA AHSSVPLGRS 
PSHATALESD YNEDNLDYYD VNEBBABYLP GBPEBALNGH SSIDVSFQSK VDTKPLBESS 

55 TLDPHTKEGT SLGEVGGPDL KGQVEPWDER BTRSPAPPEV DRDSLAPSWB TPPPYPENGS 
IQPYPDGGPV PSBMDVPPAH PEEEIVLRSY PASGHTTPLS RGTYEVGLED NIGSNTEVFT 
YNAANKETCB HNHRQCSRHA PCTDyATGPC CHCQSKFYGN GKHCLPBGAP HRVNGKVSGH 
LHVGBTPVHP TDVDLHAYIV GNDGRAYTAI SHIPQPAAQA LLPLTPIGGL FGWLPALEKP 
GSENGFSLAG AAPTHDMEVT FYPGBETVRl TQTAEGLDPE NYLSIKTNIQ GQVPYVPANF 

60 TAHISPYKEL YHYSDSTVTS TSSRDYSLTF GAINQTWSYR IHQNITYQVC RHAPRHPSFP 
TTQQLHVDRV FALYNDEERV LRFAVTNQIG PVKEDSDPTP VNPCYDGSHM CDTTARCHPG 
TGVDYTCECA SGYQGDGRKC VDENECATGF HRCGPNSVCI NLPGSYRCEC RfiGYEFADDR 
HTCILITPPA NPCEDGSHTC APAGQARCVH HGG5TFSCAC LPGYAGDGHQ CTDVDECSKN 
RCHPAATCYN TPGSFSCRCQ PGYYGDGFQC IPDSTSSLTP CEQQQRHAQA QYAYPGARFH 

65 IPQCDEQGNF LPLQCHGSTG FCWCVDPDGH BVPGTQTPPG STPPHCGPSP EPTQRPPTIC 
ERWRENLLEH YGGTPRDDQY VPQCODLGKF IPLQCHGKSO FCWCVDKDGR EVQGTRSQPG 
TTPACIPTVA PPMVRPTPRP DVTPPSVGTF LLYTQGQQIO YLPLNGTRLQ KDAAKTLLSL 
HGSIIVGIDY DCRERMVYWT DVAGRTISRA GLELGAEPET IVNSGLISPE GLAIDHIRRT 
MYWTDSVLDK IESALLDGSE RKVLFYTDLV NPRAIAVDPI RGNLYWTDWN REAPKIETSS 

70 LDGBNRRILI NTDIGLPNGL TFDPFSKliLC WADAGTKKLE CTLPDGTGRR VIQNNIiKYPF 
SIVSYAJDHFY HTDWRRDGW SVNKHSGQPT DBYLPEQRSH LYGITAVYPY CPTGRK 

Seq ID NOi 118 DNA secnience 
Nucleic Acid Accession #: NM 003088.1 
75 Coding sequence: 112-1593 (underlined sequences correspond to start and stop codons) 
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I 11 21 31 41 SI 

II III! 

GCGGAGGGTG CGTGCGGGCC GCGGCAGCCG AACAAAGGAG CAGGGGCGCC GCCGCAGGGA 60 

CCCGCCACCC ACCTCCOGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC X20 

5 AACGGCACAG CCGAGGCGGT QCAGATCCAG TTOGGCCTCA TCAACTGCGG CAACAAGTAC 180 

CTGACGGCCG AGGCGTTCGG OTTCAAGGTG AACGCGTCCG CCAGCAOCCT GAAOAAGAAG 240 

CAGATCTGGA CX3CTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 

AGCCACCTGG GCOGCTACCT GGCGGCGGAC AAGGACGGCA ACGTQACCTG CGAGCGCGAG 360 

GTGCCCGGTC CCGACTGCCO TTTCCTCATC GTGGOGCACG ACGACGGTCQ CTGGTOGCTG 420 

10 CAGTCCGAGG CQCAC06GCG CTACTTCGGC GGCACCOAGG ACCGCCTOTC CTGCTTOGCG 480 

CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCG CCATGCACCC TCAGGTCAAC 540 

ATCTACAGTG TCACCCGTAA GCGCTAOOCG CACCTGAGCG CGCGGCCGGC CGAOGAGATC 600 

GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 

CAGCGCTACA GCGTGCAGAC CGCCOACCAC CGCTTCCTGC GCCACGACGG GOGCCTGGTG 720 

15 GCGCGCCCCG AGCCGGCCAC TGGCTACAGG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 780 

CGCCACTGCG AGGGCCGTTA CCTGGCGCCG TCGGGGCCCA GCGGCACGCT CAAGGCOGGC 840 

AAGQCCACCA AGGTGGGCAA GGACGAGCTC TTTGCTCTGG AGCAGAGCTO CGCCCAGGTC 900 

GTGCTGCAGG CGGCCAACGA GAGGAAOGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 

AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGATCGACCG OGACACCAAA X020 

20 AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGAOGGCCAC OGGGGGCGTG X080 

CAOTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCOTGACCGG 1140 

CGCATCACAC TGAGGGOGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAQCTG 1200 

GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 1260 

CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 

25 CTGGACGCCA ACCGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAAOGA TGGCGCCTAC 1380 

AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 1440 

AGCGGCGACA CTCCTGTGGA CTTCTTCTTC GAGTTCTGCG ACTATAACAA GGTGGCCATC 1500 

AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 

ACCGTGGACC CCGCCTCGCT CTGGGAGTAC TAGGGCCGGC CCGTCCTTCC CCGCCCCTGC 1620 

30 CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 

GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 

CCTGTCGCCC CTATQGACTC CCCACTCTCC CCTCCGCCCG GGTTCCCTAC TCCCCTCGGG 1800 

TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA I860 

CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTG 1920 

35 GAAGCGGCTA AGGGAOGGTT GGGGGCTGGG AGCCCTGGGC GTOTAGTGTA ACTGGAATCT 1980 

TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCAAGC 2040 

CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACOCC TGCTTGGGAA GGGAAGCTGT 2100 

CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2220 

40 ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATQAC 2340 

CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTOGCT 2400 

GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2520 

45 GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 

CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 

TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 

TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2760 
AGTCTGC 

50 

Seq ID No: 119 Protean sequence: 
Protein Acceasion #: NPJJ03079.1 

55 1 11 21 31 41 51 

I I I I I I 

MTANGTAEAV QIQFGLINCG NKYLTABAFG FKVNASASSL KKKQIWTLEQ PPDBAG8AAV 60 

CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YPGGTEDRLS 120 

CFAQTVSPAE KWSVHIAHHP QVNIYSVTRK RYAHLSARPA DBIAVDRDVP WGVDSLITLA 180 

60 FQDQRYSVQT ADHRFLRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSGPSGTL 240 

KAGKATKVGK DELFALEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLSIDR 300 

DTKKCAFRTH TGKYWTLTAT GGVQSTASSK NASCYFDIEK RDRRITLRAS NGKFVTSKKN 360 

GQLAASVBTA GDSELFLMKL INRPIIVFRG EHGFIGCRKV TGTLDANRSS YDVFQLEFND 420 

GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FFFEFCDYNK VAIKVGGRYL KGDHAGVLKA 460 

65 SAETVDPASL WEY 

Seq ID NO: 120 pNA sequence 

Nucleic Acid Accession #i NMJ)06404.1 

Coding sequence: 25-741 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAGGTCCGGA GCCTCAACTT CAGGATGTTG ACAACATTGC TGCCGATACT GCTGCTGTCT 60 
75 GGCTGGGCCT TTTGTAGCCA AGACGCCTCA GATGGCCTCC AAAGACTTCA TATGCTCCAG 120 
ATCTCCTACT TCCGCGACCC CTATCACGTG TGGTACCAGG GCAACGCGTC GCTOGGGGGA 180 



236 



WO 02/079492 



PCT/US02/04915 



CACCTAACGC ACGTGCTGGA AGGCCCAGAC ACCAACACCA CGATCATTCA GCTGCAGCCC 240 

TTGCAGGAOC CCGA6AGCTG GGCGOGCACG CAGAGTGGCC TGCAGTCCTA CCTGCTCCAG 300 

TTCCACGGCC T0GTGO6CCT GGTGCACCAG GAGCGGACCT TCGCCTTTCC TCTGACCATC 360 

CGCTGCTTCC TGGGCTGTGA GCTGCCTCCC GAGGGCTCTA GAGCCCATGT CTTCTTCGAA 420 

GTGGCTGTGA ATGGGAGCTC CTTTOTGAGT TTCCGGCCGG AGAGAGCCTT GTGGCAGOCA 4B0 

GACACCCAGQ TCACCTCOGG AGTGGTCACC TTCACCCTGC AGCAGCTCAA TGCCTACAAC 540 

OGCACTCGGT ATGAACTGCG GGAATTCCTG GAGGACACCT GTGTGCAGTA TOTGCAflAAA €00 

CATATTTCCG CGGAAAACAC GAAAGGGAGC CAAACAAGCC GCTCCTACAC TTCGCTGOTC 660 

CTGGGCGTCC TGGTGGGOGG TTTCATCATT GCTGGTGTGG CTGTAGGCAT CTTCCTGTGC 720 

ACAGGTGGAC GGCGATG TTA AT TACTCTCC AGCCCCGTCA GAAGGGGCTG GATTGATGGA 780 

GGCTGGCAAG GGAAAGTTTC AGCTCACTGT GAAGCCAGAC TCCCCAACTG AAACACCAGA 840 

AGGTTTGGAG TGACAGCTCC TTTCTTCTCC CACATCTGCC CACTGAAGAT TTGAGGGAGG 900 

GGAGATGGAG AGGAGAGGTG GACAAAGTAC TTGGTTTGCT AAGAACCTAA GAACGTGTAT 960 

GCTTTGCTGA ATTAGTCTGA TAAGTGAATG TTTATCTATC TTTGTGGAAA ACAGATAATG 1020 

GAGTTGGGGC AGGAAGCCTA TGCGCCATCC TCCAAAGACA GACAGAATCA CCTGAGGOGT 1080 

TCAAAAGATA TAACCAAATA AACAAGTCAT CCACAATCAA AATACAACAT TCAATACTTC 1140 

CAGGTGTGTC AGACTTGGGA TGGGACGCTG ATATAATAGG GTAOAAAGAA GTAACACGAA 1200 

GAAGTGGTGG AAATGTAAAA TCCAAGTCAT ATGGCAGTGA TCAATTATTA ATCAATTAAT 1260 
AATATTAATA AATTTCTTAT ATTT 



Seq ID Mo: 121 protean sequence; 
Protein Accession S: NP_006395.1 

1 11 21 

1 I I 

MLTTLLPILL LSGWAFCSQD ASDGLQRLHM 
PDTNTTIIQL QPLQBPESWA RTQSGLQSYb 
PPEGSRAHVF FBVAVNGSSF VSPRPERALW 
FLEDTCVQYV QKHISAENTK GSQTSRSYTS 



31 41 51 

I I 1 

LQISYFRDPY HVWYQGNASL GGHLTHVLBG 
LQFHGLVRLV HQERTLAPPL TIRCFLGCEL 
QADTQVTSGV VTPTLQQiaiA YNRTRYELRE 
LVLGVXiVGQP IIAGVAVGIP LCTGGRRC 



6eq ID NO: 122 DHA sequence 

Nucleic Acid Accession 8: none found 

Coding sequence: 2-505 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CGAGAAGCTG GGAGAGACAC CACTTGTCCC TGAACAAGAC AATTCAGTAA CATCTATTCC 60 

TGAGATTCCT OGATGGGGAT CACAGAGCAC GATGTCTACC CTTCAAATGT CCCTTCAAGC 120 

CGAGTCAAAG GCCACTATCA CCCCATCAGG GAGCGTGATT TCCAAGTTTA ATTCTACGAC 180 

TTCCTCTGCC ACTCCTCAGG CTTTCGACTC CTCCTCTGCC GTGGTCTTCA TATTTGTGAG 240 

CACAGCAGTA GTAGTGTTGG TGATCTTGAC CATGACAGTA CTGGGGCTTG TCAAGCTCTG 300 

CTTTCACGAA AGCCCCTCTT CCCAGCCAAG GAAGGAGTCT ATGGGCCOGC CGGGCCTGGA 360 

GAGTGATCCT GAGCCCGCTG CTTTGGGCTC CAGTTCTGCA CATTGCACAA ACAATGGGGT 420 

GAAAGTCGGG GACTGTGATC TGCGGGACAG AGCAGAGGGT GCCTTGCTGG CGGAGTCCCC 480 

TCTTGGCTCT AGTGATGC AT AGG GAAACAG GGGACATGGG CACTCCTGTG AACAGTTTTT 540 

CACTTTTGAT GAAACGGGGA ACCAAGAGGA ACTTACTTGT GTAACTGACA ATTTCTGCAG 600 

AAATCCCCCT TCCTCTAAAT TCCCTTTACT CCACTGAGGA GCTAAATCAG AACTGCACAC 660 

TCCTTCCCTG ATGATAGAGG AAGTGGAAGT GCCTTTAGGA TGGTGATACT GGGGGACCGG 720 

GTAGTGCTGG GGAGAGATAT T1TCTTATGT TTATTCGGAG AATTTGGAGA AGTGATTGAA 780 

CTTTTCAAGA CATTGGAAAC AAATAGAACA CAATATAATT TACATTAAAA AATAATTTCT 840 

ACCAAAATGG AAAGGAAATG TTCTATGTTG TTCAGGCTAG GAGTATATTG GTTCGAAATC 900 
CCAGGGAAAA AAATAAAAAT AAAAAATTAA AGGATTGTTG ATAAAA 

Seq ID No: 123 Pro^ej,n sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

EKLGETPLVP BQDNSVTSIP BIPRWGSQST MSTLQMSLQA ESKATITPSG SVISKFNSTT 60 

SSATPQAFDS SSAWPIFVS TAVWLVILT MTVLGLVKLC FHESPSSQPR KBSMGPPGLB 120 
SDPBPAALGS SSAHCTNNGV XVGDCDLRDR AEGALLAESP LGSSDA 



Seq ID HO: 124 DNA sequence 

Nucleic Acid Accession #: NM_O06500.1 

Coding sequence: 27-1967 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I 1.1 I 1 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
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GCATCTTCTT GTGCCAGGGC AAOCQCCCTC GGTCCCAGQA GTACCGCATC CAGCTCCGGG 420 

TCTACAAAGC TOCGGAGGAO CCAAACATCC AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA COGGTACCCC ATTCCTCAAG 54 0 

TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

GGAACCACAT GAAGGAGTCC AGGGAAGTCA CGGTCCCTGT TTTCTACCOG ACAGAAAAAG 780 

TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG CAAGCAGAAC COCAGCACCA 900 

GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT GOTGCTGGAG CCTGCCOGGA 960 

AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT GGACACCATG ATATCGCTGC 1020 

TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA CGTCCGAGTO AGTCCCGCAG 1080 

CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGOGGCTA TCGCTGCGTG GOGTCTGTGC 1260 

CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT GGCCATTTTT GGCCOCCCTT 1320 

GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA CGTCAAOGGC AOGGCAAGTG 1440 

AACAAGACCA AGATCCACAG CGAGTCCTGA GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

TGTTGGAGAC AGGTGTTGAA TGCAOGGCCT CCAAOGACCT GGGCAAAAAC ACCAGCATCC 1560 

rCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA GGGCAAGCTG COGTGCAGGC 1800 

GCTCAGGGAA GCAGGAGATC ACGCTGCCCC OGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC AAOCAAGGAG CCCCAGTCTC 2220 

CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCOGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2S20 

ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 2640 

TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT AGTCCCAGCT ACTOGGAAGG 2760 

CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

CACTGCACTC CAGCCTGGGC AACACAGCGA GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

ACGOGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG GGATTAAAGC TATGGTTATA 3060 

TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

AGAATGGTAC TTAGGGATGQ AAAACGGGGC CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 3240 

TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA TATATATATA TATGAAAAAT 3300 

AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA AACOGTTTCC AGTTGGCAGC 3420 

AGAGATCAGO GGTTACCTCT GCTTCTGAGC AAATGGCTCA AGCTCTACCA GAGCAGACAG 3480 

CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA CTT 

Seq ID No: 125 protein, aequencq; 
Protein Accession ft: NP_Q06491.1 



1 11 21 31 41 51 

I I I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNIiSHV 60 

DWFSVHKEKR TTiIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPBBPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVTWYKNGRP 180 

LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVPYPTE KVWLEVEPVG MLKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEBBTTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSBPQBLLV NYVSDVRVSP AAPBRQEGSS 360 

LTLTCEAESS QDLEPQWLRE ETDQVLERGP VliQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

QLVKLAIPGP FWMAFKERKV WVKENMVUIL SCEASGHPRP TIQWNVNGTA SEQDQDPQRV 480 

IiSTLNVLVTP ELLETGVBCT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTKR KLPEPESRGV VIVAVrVCIL VUWIiGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTEIiW BVKSDKLPBS MGLLQGSSGD KRAPGDQGEK YIDLRH 



Seq ID NO: 126 pNft sequence, 

Nucleic Acid Accession ft: NNJ)019S5.1 
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Coding sequence: 337-975 (underlined sequences correspond to start and atop codons) 



10 



15 



20 



25 



i 
I 

GGAGCTGTTT 
AAGTCAGACG 
AGCTCTCCAC 
A.GGCGCTGCC 
AGGAACCCGC 
GTTTGAACGG 
TCTCTGCTGT 
AGCGCGGTGG 
CGGTCCAAGC 
CTGGACATCA 
AGGTCCAAGA 
TGCCAATGTG 
CTCAGGGCTG 
TCCAAGCTTG 
AGTTCA3AGO 
TCTTTTCATG 
COAGCACATT 
TGTGGCCGAC 
TTCCTGACTG 
TCCCCCAACC 
TGGGGATGAC 



11 
I 

ACCCCCACTC 
CGCCTCTQCA 
CACCGCCGCO 
TTTTCTCCCC 
AGCGCTTTGA 
GAGGTTTTTG 
TTGTGGCTTG 
GTGAGAACGG 
GCTGCTCCTG 
TTTGGGTCAA 
GAGCCTTGGA 
CTAGCCAAAA 
AAGACATTAT 
GGAAAAAGTG 
AACACCTAAG 
ATCCCAAGCT 
GGTGACAGAC 
TCTGCACTCT 
GCAAAGGACC 
ATCTTCACTG 
AATGGACCTC 



21 
I 

TAATAGGGGT 
TCTGCGCCAG 
TOCaCCTGCA 
GTTAAAGGGC 
GGGACCTGAA 
ATCCCTTTTT 
CCAAGGAGCT 
OGGGGAGAAA 
CTOGTCCCTG 
CACTCCCGAG 
GAATTTACTT 
AOACAAGAAG 
GGAGAAAGAC 
TATTTATCAG 
ACAAACCAGG 
GAAAGGCAAG 
TTCGGGGCCT 
CCACCCTGGC 
AGCGTCCTCG 
GCTTCCATCA 
TCAGCAGAAA 



31 
I 

TCAATATAAA 



GACGCTCOGC 
ACTTGGGCTG 
GCTGTTTTTC 
TTCAGAATGQ 
CCAGAAACAG 
CCCACTCCCA 
ATGGATAAAG 
CACGTTGTTC 
CCCACAAAGG 
TGCTGGAATT 
TGGAATAATC 
CAGTTAGTGA 
TCGGAGACCA 
CCCTCCAOAG 
GTCTGAAQCC 
TGGGATCAGA 
TTCAAAACAT 
GTGGTAACTG 
CACACAGTCA 



41 
I 

AAGCCGGCAG 
CCTGOGCCTC 
TCGCTGCCTT 
AAGGATCGCT 
TTCGTTTTCC 
ATTATTTOCT 
CAGTCTTAGG 
GTCCACCCTG 
AGTGTGTCTA 
CGTATGGACT 
CAACAGACCG 
TTTGCCAAGC 
ATAAGAAAGG 
GAGGAAGAAA 
TGAGAAACAG 
AGCQTTATGT 
ATAGCCTCCA 
GCAGGAGCAT 
TOCAAGAAAG 
CTTTGGTCTC 
CATTCGAATT 



51 
I 

AGAGCTGTCC 
CTGCAGTCCC 
CTCTCCTGGC 
TTGAGATCTG 
TTTGGGTTCA 
CATGATTTTC 
CGCTGAGCTC 
GCGGCTCCGC 
CTTCTGCCAC 
TGGAAGCCCT 
TGAGAATAGA 
AGGAAAAGAA 

aaaagactgt 
aatcagaaga' 
cgtcaaatca 

GACCCACAAC 
CGQAGAGCCC 
CCTCTGCTGG 
GTTAAGGAGT 
TTCTTTCATC 
C 



€0 
120 
180 
240 
300 
360. 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1060 
1140 
1200 
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Seq ID No: 127 
Protein Accession #: NP_001946.1 
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I i I I I I 

MDYLLMIFSIi LFVACQGAPB TAVLGAELBA VGENGGBKPT PSPPWRLRRS KRCSCSSLMD 60 
35 KBCVYFCHLD IIWVNTPEHV VPYGLGSPRS KRALENLLPT KATDRENRCQ CASQKDKKCW 120 
NFCQAGKELR AEDIMEKDWN NHKKGKDCSK LGKKCIYQQL VRGRKIRRSS EEHLRQTRSB 180 
TMRNSVKSSF HDPKLKGKPS RERYVTHNRA HW 

40 Seq ID NO: 128 PNA, sequence 

Nucleic Acid Accession #; NM_001721.1 

Coding sequence: 34-2061 (underlined sequences correspond to start and stop codons) 



45 



50 



55 



60 



65 



70 



75 



i 
I 

GCAAGCACGG 
CTTCTTCTCA 
CTTTTTGTTT 
AGCAGAAAAG 
GAGCAGACGC 
TATGTCTATG 
ATAAGGGGTA 
TTCCTGTGTT 
GCTAATCTGC 
GTGCTGAAGA 
ACTCTAGCCC 
AGTACCAGTC 
TTCAACATGC 
AAAAGTAGCA 
CACACCACCT 
AACCTGGATG 
CTCAGACAAA 
TACACAQTGT 
CACGTGCATA 
ATTCCAAAGC 
CACCCTGTGT 
TGGGAACTGA 
GTGGTCCAGC 
GGCTCCATGT 
CCCAAGCTGG 
GAATATATAA 
CCTTCCCAGC 
CACCAATTCA 
GTGAAAGTAT 



11 

I 

AACAAGCTGA 
AAAGATCACA 
TGACCAAAAC 
GATCCATTGA 
CTGTAGAGAG 
CATCAAATGA 
ACCCCCACCT 
GCCAGCAGAG 
ATACTGCAGT 
TACCTCGGGC 
AATATGACAA 
TAGCGCAATA 
AGTATATTCC 
GCAGCAGTGA 
CAAAGATTTC 
ATTATGACTG 
AGGGAAAAGA 
CCTTATTTAG 
CAAATGCTGA 
TTATTCATTA 
CAACAAAGGC 
AAAGAGAAGA 
TGGGCAAGTG 
CAGAAGATGA 
TTAAATTCTA 
GCAATGGCTG 
TCTTAGAAAT 
TACACCGGGA 
CTGACTTTGG 



21 
I 

GAOGGATGAT 
GCAAAAGAAG 
AAACCTTTCC 
AATTAAGAAA 
ACAGTACCCA 
AGAGAGCCGA 
GCTGGTCAAG 
CTGTAAAGCA 
CAATGAAGAG 
AGTTCCTGTT 
CGAATCAAAG 
TGACAGCAAC 
AAGGGAAGAC 
AGATGTTGCA 
ATGGGAATTC 
GTTTGCTGGT 
AGGAGCATTT 
TAAGGCTGTG 
GAACAAATTA 
TCATCAACAC 
CAACAAGGTC 
GATTACCTTG 
GAAGGGGCAG 
ATTCTTTCAG 
TGGAGTGTGT 
CTTGCTGAAT 
GTGCTACGAT 
CTTGGCTGCT 
AATGACAAGG 



31 
I 

AATATGGATA 
AAAATGTCAC 
TACTATGAAT 
ATCAGATGTG 
TTTCAQATTG 
AGTCAGTGGT 
TACCATAGTG 
GCCCCAGGAT 
AAACACAGAG 
CTCAAAATGG 
AAAAACTATG 
TCAAAGAAAA 
TTCCCTGACT 
AGCAGTAACC 
CCTGAGTCAA 
AACATCTCCA 
ATGGTTAGAA 
AATGATAAAA 
TACCTGGCAG 
AATTCAGCAG 
CCCGACTCTG 
TTGAAGGAGC 
TATGATGTTG 
GAGGCCCAGA 
TCAAAGGAAT 
TACCTGAGGA 
GTCTGTGAAG 
CGTAACTGCT 
TATGTTCTTG 



41 

I 

CAAAATCTAT 
CAAATAATTA 
ATGACAAAAT 
TGGAGAAAGT 
TCTATAAAGA 
TGAAAGCATT 
GGTTCTTCGT 
GTACCCTCTG 
TTCCCACCTT 
ATGCACCATC 
GCTCCCAGCC 
TCTATGGCTC 
GGTGGCAAGT 
AAAAAGAAAG 
GTTCATCTGA 
GATCACAATC 
ATTCGAGCCA 
AAGGAACTGT 
AAAACTACTG 
GCATGATCAC 
TGTCCCTGGG 
TGGGAAGTGG 
CTGTTAAGAT 
CTATGATGAA 
ACCCCATATA 
GTCACGGAAA 
GCATGGCCTT 
TGGTGGACAG 
ATGACCAGTA 



51 
I 

TCTAGAAGAA 
CAAAGAACGG 
GAAAAGGGGC 
AAATCTCGAG 
TGGGCTTCTC 
ACAAAAAGAG 
GGACGGGAAG 
GGAAGCATAT 
CCCAGACAGA 
TTCAAGTACC 
ACCATCTTCA 
CCAGCCAAAC 
AAGAAAACTG 
AAATGTGAAT 
AGAAGAGGAA 
TGAACAGTTA 
AGTGGGAATG 
CAAACATTAC 
TTTTGATTCC 
ACGGCTCCGC 
AAATGGAATC 
CCAGTTTGGA 
GATCAAGGAG 
ACTCAGCCAT 
CATAGTGACT 
AGGACTTGAA 
CTTGGAGAGT 
AGATCTCTGT 
TGTCAGTTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
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GTCGGAACAA AGTTTCCAGT CAAGTGGTCA GCTCCAGAGG TGTTTCATTA CTTCAAATAC 1800 

AGCAGCAAGT CAGACGTATG OGCATTTGGG ATCCTGATGT GGGAGGTGTT CAGCCTGGGG 1860 

AAGCAGCCCT ATGACTTQTA TGACAACTCC CAGGTGGTTC TGAAGGTCTC CCAGGGCCAC 1920 

AGGCTTTACC GGCCCCACCT GGCATCGGAC ACCATCTACC AGATCATGTA CAGCTGCTGG 1980 

CACQAGCTTC CAGAAAAGCG TCCCACATTT CAGCAACTCC TGTCTTCCAT TGAACCACTT 2040 

CG6GAAAAAG ACAAGCA TTQ AA GAAGAAAT TAGGAOTGCT OATAAOAATO AATATAGATO 2100 

CTGGC CAGCA TTTTCATTCA TTTTAAGGAA AGTAGGAAGG CATAAOTAAT TTTAGCTAGT 2160 

TTTTAATAGT GTTCTCTOTA TTQTCTATTA TTTAGAAATQ AAGAAGGCAG QAAACAAAAO 2220 

ATTCCCTTC3A AATTTAGATC AAATTAGTAA TTTTGTTTTA TGCTGCTCCT GATATAACAC 2280 

TTTCCAGCCT ATAGCAGAAG CACATTTTCA QACTGCAATA TAGAGACTGT GTTCATGTGT 2340 

AAAGACTGAG CAGAACTGAA AAATTACTTA TTGGATATTC ATTCTTTTCT TTATATTGTC 2400 
ATTGTCACAA CAATTAAATA TACTACCAAG TACAGAAATG TGGAAAAAAA AAACCG 

Seq ID No i 129 Protfijn. sequence; 
Protein Accession #t HP 001712.1 



1 11 21 31 41 51 

I I I I I I 

MDTKSILEEL LliKRSQQKKK MSPNNYKERL PVLTKTNLSY YBYDKMKRG3 RXGSIEIKKI 60 

RCVBKVNIxEE QTPVERQYPP QIVYKDGLLY VYASNEESRfl QWLKALQKEI RGNPHLLVKY 120 

HSGFPVDGKP LCCQQSCKAA PGCTLWBAYA NLHTAVNEEK HRVPTPPDRV LKIPRAVpVL 180 

KMDAPSSSTT LAQYDNESKK NYGSQPPSSS TSLAQYDSNS KKIYGSQPNP NMQYIPREDF 240 

PDWWQVRKLK 8SSSSEDVAS SNQKBRNVNH TTSKISWEFP ESSSSBEESN LDDYDWFAGN 300 

ISRSQSBQLXj RQKGKEGAFM VRNSSQVGMY TVSLF6KAVN DKKGTVKHYH VHTNAENKLY 360 

LAENYCFDSI PKLIHYHQHN SAGMITRLRH PVSTKANKVP DSVSLGNGIW ELKREEITLL 420 

KELGSGQFGV VQLGKMKGQY DVAVKMIKEG SM5BOBFFQE AQTMMKLSHP KLVKPYGVCS 480 

KEYPIYIVTE YISNGCLLNY LRSHGKGLEP SQLItEMCYDV CEGMAFLESH QFIHRDIAAR 540 

NCLVDRDLCV KVSDFGMTRY VLDDQYVSSV GTKFPVKWSA PBVFHYFKYS SKSDVHAFGI 600 

LMWEVFSLGK QPYDLYDNSQ WLKVSQGHR LYRPHLASDT IYQIMYSCWH ELPEKRPTFQ 660 
QLLSSIEPLR EKDKK 



Seq ID NO; 130 DNA sequence 

Nucleic Acid Accession #j NM_012072.2 

Coding sequence: 149-2107 (underlined sequences correspond to start and stop codons) 



l 11 21 31 41 51 
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AAAGCCCTCA GCCTTTGTGT CCTTCTCTGC GCCGGAGTGG CTGCAGCTCA CCCCTCAGCT 60 

CCCCTTGGGG CCCAGCTGGG AGCCGAGATA GAAGCTCCTG TCGCCGCTGG GCTTCTCGCC 120 

TCCCGCAGAG GGCCACACAG AGACCGG GAT GG CCACCTCC ATGGGCCTGC TGCTGCTGCT 180 

GCTGCTGCTC CTGACCCAGC CCGGGGCGGG GACGGGAGCT GACACGGAGG CGGTGGTCTG 240 

CGTGGGGACC GCCTGCTACA CGGCCCACTC GGGCAAGCTG AGCGCTGCCG AGGCCCAGAA 300 

CCACTGCAAC CAGAACGGGG GCAACCTGGC CACTGTGAAG AGCAAGGAGG AGGCCCAGCA 360 

CGTCCAGCGA GTACTGGCCC AGCTCCTGAG GCGGGAGGCA GCCCTGACGG CGAGGATGAG 420 

CAAGTTCTGG ATTGGGCTCC AGCGAGAGAA GGGCAAGTGC CTGGACCCTA GTCTGCOGCT 480 

GAAGGGCTTC AGCTGGGTGG GCGGGGGGGA GGACACGCCT TACTCTAACT GGCACAAGGA 540 

GCTCCGGAAC TCGTGCATCT CCAAGCGCTG TOTGTCTCTG CTGCTGGACC TGTCCCAGCC 600 

GCTCCTTCCC AACCGCCTGC CCAAGTGGTC TGAGGGCCCC TGTGGGAGCC CAGGCTCCCC 660 

CGGAAGTAAC ATTGAGGGCT TCGTGTGCAA GTTCAGCTTC AAAGGCATGT GCCGGCCTCT 720 

GGCCCTGGGG GGCCCAGGTC AGGTGACCTA CACCACCCCC TTCCAGACCA CCAGTTCCTC 780 

CTTGGAGGCT GTGCCCTTTG CCTCTGCGGC CAATGTAGCC TGTGGGGAAG GTGACAAGGA 940 

CGAGACTCAG AGTCATTATT TCCTGTGCAA GGAGAAGGCC CCCGATGTGT TCGACTGGGG 900 

CAGCTCGGGC CCCCTCTGTG TCAGCCCCAA GTATGGCTGC AACTTCAACA ATGGGGGCTG 960 

CCACCAGGAC TGCTTTGAAG GGGGGGATGG CTCCTTCCTC TGCGGCTGCC GACCAGGATT 1020 

CCGGCTGCTG GATGACCTGG TGACCTGTGC CTCTCGAAAC CCTTGCAGCT CCAGCCCATG 1080 

TCGTGGGGGG GCCACGTGCG TCCTGGGACC CCATGGGAAA AACTACACGT GCCGCTGCCC 1140 

CCAAGGGTAC CAGCTGGACT CGAGTCAGCT GGACTGTGTG GACGTGGATG AATGCCAGGA 1200 

CTCCCCCTGT GCCCAGGAGT GTGTCAACAC CCCTGOGGGC TTCCGCTGOG AATGCTGGGT 1260 

TGGCTATGAG CCGGGCGGTC CTGGAGAGGG GGCCTGTCAG GATGTGGATG AGTGTGCTCT 1320 

GGGTCGCTCG CCTTGCGCCC AGGGCTGCAC CAACACAGAT GGCTCATTTC ACTGCTCCTG 1380 

TGAGGAGGGC TACGTCCTGG CCGGGGAGGA CGGGACTCAG TGCCAGGAOG TGGATGAGTG 1440 

TGTGGGCCCG GGGGGCCCCC TCTGCGACAG CTTGTGCTTC AACACACAAO GGTCCTTCCA 1500 

CTOTGGCTGC CTGCCAGGCT GGGTGCTGGC CCCAAATGGG GTCTCTTGCA CCATGGGGCC 1560 

TGTGTCTCTG GGACCACCAT CTGGGCCCCC GGATGAGGAG GACAAAGGAG AGAAAGAAGG 1620 

GAGCACCGTG CCCCGCGCTG CAACAGCCAG TCCCACAAGG GGCCCCGAGG GCACCCCCAA 1680 

GGCTACACCC ACCACAAGTA GACCTTCGCT GTCATCTGAC GCCCCCATCA CATCTOCCCC 1740 

ACTCAAGATG CTGGCCCCCA GTGGGTCCTC AGGCGTCTGG AGGGAGCCCA GCATCCATCA 1800 

CGCCACAGCT GCCTCTGGCC CCCAGGAGCC TGCAGGTGGG GACTCCTCOG TGGCCACACA I860 

AAACAACGAT GGCACTGACG GGCAAAAGCT GCTTTTATTC TACATCCTAG GCACCGTGGT 1920 

GGCCATCCTA CTCCTGCTGG CCCTGGCTCT GGGGCTACTG GTCTATCGCA AGCGGAGAGC 1980 

GAAGAGGGAG GAGAAGAAGG AGAAGAAGCC CCAGAATGCG GCAGACAGTT ACTCCTGGGT 2040 

TCCAGAGCGA GCTGAGAGCA GGGCCATGGA GAACCAGTAC AGTCCGACAC CTGGGACAGA 2100 

CTGCJGAAAG TGAGGTGGCC CTAGAGACAC TAGAGTCACC AGCCACCATC CTCAGAGCTT 2160 
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TGAACTCCCC ATTCCAAAGG GGCACCCACA 
CAAACAATTG TAAGTCTCCT CCTTAAAGGC 
TGTTTGATGT TCCTGAAGTG GAAQCTGTOT 
TCTATAATGA TTGTTACTCC CCCTCCCTTT 
5 GGGTGTGAGG AGOCTGGGGC TAAGGGGCTC 
ATCTAAGAGG AAAAGOTGAG TTGCTCATGC 
CCTAGGATGA AAACTAAATC AATTAATTAT 
TCAAAOOGAA CATGTTCGGA CTGGAAACAT 
AGCACAAGTC TTGCTAAATG TGATACTGTT 

10 TAACCTCTTA GGTGGCAAGG AGGCAGGAAG 
CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA 
CAGGTGTTTG TGAAGTCACA TAATCTACGG 
CACAGATACT TOAATTAATT CATCCAAATG 
TGTGATCAAC ACTAACAAGG AAACAAATTC 

15 CCTCAGACAC CCTGCCTOTO GCCCCGCCTC 
CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA 
TGAACGGGAG ATGATGCACT GTGTTTTGAA 
TCATAGTCCA CAGTTGATGC AGCATCCTGA 
CACACCAAOT AGGGAGCTAG TCAGGCAGTT 

20 TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG 
TTTTTACAGC AAAAACTGCT CAAAGCCATT 
TTGCAAATAT TTCTCCCTAT GATAATGCAG 
TCTCTCTCAC ACACACACAC ACACACACAC 
CCTGGGGCAC TGGAACACAT TCCTGGGGGT 

25 TGAGTATCTC TGGGAGGCCT CATGTCTCCT 
ACAGACAGAG GAAATGTGTC TCCCTCCAAG 
GGTTTTGCCT TAGCAATGCA TCGGTCTCTG 
CAAGGTGCAG GGTTAATACT CTTGCCAGTT 
TTTTAATAGA AAACTAAAGG GGCAGGGGAA 

30 TCGATGGGGC ATTTGGAACT TCTTTTTAAA 
AACTCTGGTG TTTAACACTT AAGGGAGACA 
GGCCACGAGA CTCTAGGTGA TGTGTGAAGC 
CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC 
CACTTAAATA AATGCAAATQ CAACATTTCT 

35 TCATTTGGGG TGAAGGAGAC ATTTCTGTCC 
GTATGATTCC TGGGATCCAA OGAGCCCTCC 
GCCCAGGCCC ATCGTCTGTT CTCTGAATGC 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA 
ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT 

40 CCCACCACGG GGGGAGAGCC AGCAACCCAA 
AACCACTGGG CTCAAACACG TGCTTTATTC 
ATGGAAATTC TTGTTTGGGG GATCTTGGGG 
GCCAAGAGGC CATTAACAAA TCGTCCTTGT 
CACAGTGGGG AATCCAAGGG TCACAGTATG 

45 TCTCGCTAGA CACAGTGTTT CTGCCCAGGT 
CATGGGGACG GGGGAAGTTT TCACTTGGAG 
CCAAATAGGT CAATAATTCT GGGAGACTCT 
TCTCTCCCTC CCCTCATCCC ACATCTCAAA 
CACCCAGCTC GCCATGCCTA CTCATTCCTG 

50 CTTCTTTGTC ATTTQAGAAA GGATGCAGGA 
CAGAAAAACC AGGGCAGGAC AGTTATCGAC 
TAGAGGGACT CCACCCCTGC TCAACAGCTT 
CTCTGCCTTC GGTGGCCCAC ACACCTAAGC 
AACACATCTA CGTGTAGCAC TACGACGTTA 

55 AGGCTCTGAT TAAGGATGTG GGGAAGTGGG 
CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA 
TGCCATCTTC CCTGCGATCA GGCAAAAAAG 
TGTGTTATGT CCATTTTGCA GGATGAACTG 
TTGCTTTGTC TTTTCCATCC TCATCACAAG 

60 TCTTTCGATG OATGGAGATG ATCATTAGGT 
ATTTCTGTGA AAACTAGGAG AACAGAGATG 
TAACACAGTC TTTTTAAAAC TAACATAGOA 
TCTCCATTGT CTAAATCAQG AAAACAGGAA 
TTAATGCCCC CTACATATTT CCATCACCTT 

65 TATGATCCCA GAAAACATCT GTCTCTACTT 
TGGTTTGTGC ATTTTCTCAA CTAAAAATAG 
CTAATCAAAG ACACTATTTT CATACTAGAT 
TTTAAAAATA AATTGTGTTT TGGTCTGTTC 
AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG 

70 TTCAGCAGAT TTTGCCCACT ATTCCTCTGA 
GCTTGAATTA GATCCCTGCA AAGGCTTGCT 
GTAATCACTT CATGAATCCT AAATGAGAAT 
TTTGTTTGAC TAATTCTGGA ATTACAAGAT 
ATGTTTCCCA AACTGTGAGG AGGGAAGGCT 

75 CAAAATGGTG CTTTGAGGGT CAGCCTTTAG 
TCTGTTATGT OCCTATCCTA ATAAACTCTT 



TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

GTTGGCGTGC CAOGGTGGGG ATTTOGTGAC 2340 

TCAAATTCCA ATGTGACCAA TTCOGGATCA 2400 

CCCTGAATAT CTTCTCTGCT CACTT CCACC 2460 

TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

GACATCCTCC AGAATGGCCA GAAOTGCAAT 2700 

TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

AATATGAGAA AAGTTGCTTG AAGTGCATTA 2620 

GQCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

TACTGAGGTT ACCACACACT TGACTACGGA 2940 

AAGGACAACC TGTCTTTQAG CCAGGGCAGG 3000 

CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

GTTCCATCAG GCTOTTTCCT AAAGGATGTG 3120 

AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

GATTTTAAAT CCTGAAGTGT GGGTGGCGCA 3240 

TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

AAGAQGGAAA GAGATGACTA ACTAAAATCA 3360 

TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3460 

ACACACACAC AGAGACACGG CACCATTCTG 3540 

CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

TTGAAATATA GATGCTATGG TTCAGATTGT 3840 

GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4020 

TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

ATGGCTGGAT GCGCTGCTGA CCAACATCAG 4140 

CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

AGCCCTGTTC TCAACAACAO GGAGGTCATG 4380 

AATGGGTGAT AAAOAATCCA GTTCCTCAAA 4440 

GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

CTACAGTAGT GGGTAAACAA ATGCCCACCG 4680 

CCTOAGGGGC CCCAGCTTGC TCGGGCGTGG 4740 

GGGAGAGGTG CACCCTGOCA CCTGCTAACT 4800 

GACCTGTTCA GCAGCAOAAC AAGCCAGGGC 4860 

ATGGACACCA AGACAATGAA GATTTGTTGT 4920 

TGGAAAAAAC TGAATATATT CAGGACCAAC 4960 

GCAGACAATG TAAAGAGAGA ACATCTCACA 5040 

AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

TGTTTGGGTA ATQTGGGGAT GAACTGCATG 5400 

CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

AAGCCTTTCC TATTATTTCT CTTCTTAGCT 5880 

AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

AGATGATAAT COGAATTCTC CATATATTCA 6120 

TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAAGTATTT TTAAATGTGT GTATTTCAAA 6460 

TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

GAAGGTGCAG CTTTGTTQTC CTTTGAGCTT 6660 
AAACACATT 
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Seq ID Noi 131 j 
Protein Acceeaion ft: NP 036204.1 



MATSMGLLLL 
ATVKSKEEAQ 
BDTPYSNWHK 
KPSFKGMCRP 
KEKAPDVFDW 



11 
I 

LLLLLTQPGA 
HVQRVLAQLL 
BLRNSCISKR 
LALGGPGQVT 
GSSGPZjCVSP 
CRGGATCVLG 



TPGGFRCBCW 
DGTQCQDVDE 



SGVWREPSIH 
LGLLVYRKRR 



CVGPGGPLCD 
GSTVPRAATA 
HATAASGPQE 



21 
I 

GTGADTEAW 
RREAALTARM 
CVSLLLDLSQ 
YTTPPQTTSS 
KYGCNFNNGG 
PHGKNYTCRC 
GACQDVDECA 
SLCPNTQGSP 
SPTRGPEGTP 
PAGGDSSVAT 
PQNAAD8YSW 



31 
I 

CVGTACYTAH 
8KFWIGLQRB 
PLLPNRLPKW 
SLEAVPFASA 
CHQDCFBGGD 
PCGYQLDSSQ 
LGRSPCAQGC 



KATPTTSRPS 
QNNDGTDGQK 



41 
I 

SGKLSAABAQ 
KGKCLDPSLP 
SBGPCGSPGS 
ANVACGEGDK 
GSFLCGCRPG 
tiDCVDVDECQ 
TNTDGSFHCS 
APNGV8CTMG 
LSSDAPITSA 
LLLFYILGTV 
BNQYSPTPGT 



51 
I 

NHCNQNGGNIj 
LKGPSWVGGG 
PGSNIEGFVC 
DETQSHYFLC 
FRIiLDDLVTC 
DSPCAQECVN 
CEEGYVLAGE 
PVSLGPPSGP 
PLKMLAPSGS 
VAILLLLALA 
DC 



60 
120 
180 
240 
300 
380 
420 
480 
540 
600 



Seq ID NO: 132 PNA sequence 

Nucleic Acid Accession #> NMJ)00963.1 

coding sequence: 135-1949 (underlined sequences correspond to start and atop codons) 



CAATTGTCAT 
CTCCTTCAGC 
TGCCOGCCGC 
ATACAGCAAA 
GATTTGACCA 
CACOGGAATT 
ACATACTTAC 
ATGCAATTAT 
ACAATGCTGA 
GAGCCCTTCC 
AGCTTCCTGA 
ATCCCCAGGG 
TCAAGACAGA 
TAAATCATAT 
GAAAAATGAA 
AGGCAGAGAT 
AGGTCTTTGG 
ACAGAQTATG 
AGACAAGCAG 
AACACTTGAG 
AATTCCAGTA 
TTCTGCCTGA 
ACAACTCTAT 
TTGCTGGCAG 
CTTCCATTGA 
TTATGCTGAA 
AGTTGGAAGC 
AAAAGCCTCO 
CCTTGAAAGG 
TTGGTGGAGA 
ATAACGTGAA 
CAGTCACCAT 
TACTAAAAGA 
GAACCATGTC 
AACATCTTCT 
GACTTTTATG 
TATTCTGTTT 
TTATATTATA 
AAATGCTGAA 
GTAACTAATQ 
AGQTATCAGT 
ACTTTTTAAA 
ACCTGTAAAA 
GCTGTCTTGG 
TATTTTATAA 
AAACTTCCTT 
TCTCAAAATA 
GTAAAATCTA 
CAAATTATTG 



11 
I 

ACGACTTGCA 
TCCACAGCCA 
TCGGATGCTC 
TCCTTGCTGT 
GTATAAGTGC 
TTTGACAAGA 
CGACTTCAAG 
GAGTTATGTC 
CTATGGCTAC 
TCCTGTGCCT 
TTCAAATGAG 
CTCAAACATG 
TCATAAGCGA 
TTACGGTGAA 
ATATCAGATA 
GATCTACCCT 
TCTGGTGCCT 
CGATGTGCTT 
GCTAATACTG 
TGGCTATCAC 
CCAAAATCGT 
CACCTTTCAA 
ATTGCTGGAA 
GGTTGCTGGT 
CCAGAGCAGG 
GCCCTATGAA 
ACTCTATGGT 
GCCAGATGCC 
ACTTATGGGT 
AGTGGGTTTT 
GGGCTGTCCC 
CAATGCAAGT 
ACGTTCGACT 
TATTAATTTA 
GTAACAGAAG 
TCACTACTCT 
TATAAACCAG 
AGAACGAAAG 
AGTTTTTACA 
TTTGAAATTT 
GCATTATTAA 
ATCAGCAATG 
GCTTGTTTGA 
ATTTAAATCT 
GTGATGTTCC 
TTAAATCAAA 
AGAATATTTT 
TATCAGCAAA 
TTCAAATTTA 



21 
I 

GTGAGCGTCA 
GACGCCCTCA 
GCCCGCGCCC 
TCCCACCCAT 
GATTY3TACCC 
ATAAAATTAT 
GGATTTTGGA 
TTGACATCCA 
AAAAGCTGGG 
GATGATTGCC 
ATTGTGGAAA 
ATGTTTGCAT 
GGGCCAGCTT 
ACTCTGGCTA 
ATTGATGGAG 
CCTCAAGTCC 
GGTCTGATGA 
AAACAGGAGC 
ATAGGAGAGA 
TTCAAACTGA 
ATTGCTGCTG 
ATTCATGACC 
CATGGAATTA 
GGTAGGAATG 
CAGATGAAAT 
TCATTTGAAG 
GACATCGATG 
ATCTTTGGTG 
AATGTTATAT 
CAAATCATCA 
TTTACTTCAT 
TCTTCCCGCT 
GAACTGTAGA 
ATTATTTAAT 
TCAGTACTCC 
AAAGATTTTG 
AGAGAAATGA 
TAAAGATOTT 
CTGTCGATGT 
TAAAGTACTT 
ATGAATATTT 
AAACAATAAT 
TTTCTTAAAG 
GTAAAATCAG 
TTTTTCACCA 
ATGCCAAATT 
GTTGAGATAT 
AGGGTCTACC 
GGTTTAAACT 



31 
1 

GGAGCACGTC 
GACAGCAAAG 
TGCTGCTGTG 
GTCAAAACCG 
GGACAGGATT 
TTCTGAAACC 
ACGTTGTGAA 
GATCACATTT 
AAGCCTTCTC 
CGACTCCCTT 
AATTGCTTCT 
TCTTTGCCCA 
TCACCAACGG 
GACAGCGTAA 
AGATGTATCC 
CTGAGCATCT 
TGTATGCCAC 
ATCCTGAATG 
CTATTAAGAT 
AATTTGACCC 
AATTTAACAC 
AGAAATACAA 
CCCAQTTTGT 
TTCCACCCGC 
ACCAGTCTTT 
AACTTACAGG 
CTGTGGAGCT 
AAACCATGGT 
GTTCTCCTGC 
ACACTGCCTC 
TCAGTGTTCC 
CCGGACTAGA 
AGTCTAATGA 
AATATTTATA 
TGTTGCGGAG 
CTGTTGCTGT 
GTTTTGACGT 
TGAATACTTA 
TTCCAATGCA 
TTGGTTATTT 
AAATTAGACA 
TTGAAATTTC 
TTATTAAACT 
ATGAAATTTT 
AGAGTATAAA 
TATTAAGGTG 
TCCAGAATTT 
TTTAAAATAA 
TTTGAAGCAA 



41 

I 

CAGGAACTCC 
CCTACCCCCG 
CGCGGTCCTG 
AGGTGTATGT 
CTATGGAGAA 
CACTCCAAAC 
TAACATTCCC 
GATTGACAGT 
TAACCTCTCC 
GGGTGTCAAA 
AAGAAGAAAG 
GCACTTCACG 
GCTGGGCCAT 
ACTGCGCCTT 
TCCCACAGTC 
ACGGTTTGCT 
AATCTGGCTG 
GGGTGATGAG 
TGTGATTGAA 
AGAACTACTT 
CCTCTATCAC 
CTATCAACAG 
TGAATCATTC 
AGTACAGAAA 
TAATGAGTAC 
AGAAAAGGAA 
GTATCCTGCC 
AGAAGTTGGA 
CTACTGGAAG 
AATTCAGTCT 
AGATCCAGAG 
TGATATCAAT 
TCATATTTAT 
TTAAACTCCT 
AAAGGAOTCA 
TAAGTTTGGA 
CTTTTTACTT 
AACACTATCA 
TCTTCCATGA 
TTCTOTCATC 
TTACCAGTAA 
TAAATTCATA 
TGTACATATA 
ACTACAATTG 
CCTTTTTAGT 
GTGGAGCCAC 
GTTTATATGG 
GCAATAACAA 
ACTTTTTTTT 



51 
I 

TCAGCAGCGC 
CGCCGCGCCC 
GCGCTCAGCC 
ATGAGTGTGG 
AACTGCTCAA 
ACAGTGCACT 
TTCCTTCGAA 
CCACCAACTT 
TATTATACTA 
GGTAAAAAGC 
TTCATCCCTG 
CATCAGTTTT 
GGGGTGGACT 
TTCAAGGATG 
AAAGATACTC 
GTGGGGCAGG 
CGGGAACACA 
CAOTTGTTCC 
GATTATGTGC 
TTCAACAAAC 
TGGCATCCCC 
TTTATCTACA 
ACCAGGCAAA 
GTATCACAGG 
CGCAAACGCT 
ATGTCTGCAG 
CTTCTGGTAG 
GCACCATTCT 
CCAAGCACTT 
CTCATCTGCA 
CTCATTAAAA 
CCCACAGTAC 
TTATTTATAT 
TATGTTACTT 
TACTTGTGAA 
AAACAQTTTT 
GAATTTCAAC 
CAAGATGGCA 
TGCATTAGAA 
AAACAAAAAC 
TTTCATGTCT 
GGGTAGAATC 
CCAAAAAGAA 
CTTGTTAAAA 
GTGACTGTTA 
TGCAQTGTTA 
CTGGTAACAT 
AGAAGAAAAC 
ATCCTTGTGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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ACTGCAGGCC TGGTACTCAO ATTTTGCTAT GAGGTTAATG AAGTACCAAG CTGTGCTTOA 3000 

ATAACGATAT GTTTTCTCAO ATTTTCTGTT GTACAGTTTA ATTTAGCAOT CCATATCACA 3060 

TTGCAAAAGT AGCAATGAOC TCATAAAATA CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

CATTAATTTT ATCTCAGTCT TGAAGCCAAT TCAGTAGGTG CATTGGAATC AAGCCTGGCT 3180 

5 ACCTGCATQC TGTTCCTTTT CTTTTCTTCT TTTAGCCATT TTOCTAAGAO ACACAGTCTT 3240 

CTCATCACTT OQTTTCTCCT ATTTTQTTTT ACTAOTTTTA AGATCA6A6T TCACTTTCTT 3300 

TGGACTCTGC CTATATTTTC TTACCTGAAC TTTTGCAAGT TTTCAQGTAA ACCTCAGCTC 3360 

AGGACTGCTA TTTAGCTCCT CTTAAGAAGA TTAAAAGAGA AAAAAAAAGG CCCTTTTAAA 3420 

AATAGTATAC ACTEATTTTA AOTGAAAAGC AGAGAATTTT ATTTATAGCT AATTTTAOCT 3460 

10 ATCTGTAACC AAGATGGATG CAAAOAGGCT AGTGCCTCAG AGAGAACTGT AOGGGGTTTG 3540 

TGACTGGAAA AAGTTAOGTT CCCATTCTAA TTAATGCCCT TTCTTATTTA AAAACAAAAC 3600 

CAAATGATAT CTAAGTAGTT CTCAGCAATA ATAATAATGA CGATAATACT TCTTTTCCAC 3660 

ATCTCATTGT CACTGACATT TAATOOTACT GTATATTACT TAATTTATTG AAGATTATTA 3720 

TTTATGTCTT ATTAGGACAC TATGGTTATA AACTGTGTTT AAGCCTACAA TCATTGATTT 3780 

15 TTTTTTGTTA TGTCACAATC AGTATATTTT CTTTGGGGTT ACCTCTCTGA ATATTATGTA 3840 

AACAATCCAA AGAAATGATT GTATTAAGAT TTQTGAATAA ATTTTTAGAA ATCTGATTGG 3900 

CATATTGAGA TATTTAAGGT TGAATOTTTG TCCTTAGGAT AGGCCTATGT GCTAGCCCAC 3960 

AAAGAATATT GTCTCATTAG CCTGAATGTG CCATAAGACT GACCTTTTAA AATGTTTTGA 4020 

GGGATCTGTG GATGCTTCGT TAATTTQTTC AGCCACAATT TATTGAGAAA ATATTCTGTG 4080 

20 TCAAGCACTQ TGGGTTTTAA TATTTTTAAA TCAAACGCT3 ATTACAGATA ATAGTATTTA 4140 

TATAAATAAT TGAAAAAAAT TTTCTTTTGG GAAGAGGGAG AAAATGAAAT AAATATCATT 4200 

AAAGATAACT CAGGAGAATC TTCTTTACAA TTTTACGTTT AGAATGTTTA AGGTTAAGAA 4260 

AGAAATAGTC AATATGCTTG TATAAAACAC TGTTCACTGT TTTTTTTAAA AAAAAAACTT 4320 

GATTTGTTAT TAACATTGAT CTGCTGACAA AACCTGGGAA TTTGGGTTGT GTATGCGAAT 4380 

25 GTTTCAGTGC CTCAGACAAA TGTGTATTTA ACTTATGTAA AAGATAAGTC TGGAAATAAA 4440 

TGTCTGTTTA TTTTTGTACT ATTTA 

Seq ID Mo: 133 Protein sequence: 
Protein Acceeaion 8: NP_000954.i 



1 11 21 31 41 51 

I I I I I I 

MLARALLLCA VIALSHTANP CCSHPCQNRG VCMSVGFDQY KCDCTRTGFY GENCSTPEFL 60 

35 TRIKLFLKPT PNTVHYILTH FKGFWNWNN IPFLRNAIMS YVLTSRSHLI DSPPTYNADY 120 

GYKSWEAFSN LSYYTRALPP VPDDCPTPLG VKGKKQLPDS NBIVBKLLLR RKFIPDPQGS 180 

KMMFAPPAQH FTKQFFKTDH KRGPAPTNGL GHGVDLNHIY GETLARQRKL RLPKDGKMKY 240 

QIIDGEMYPP TVKDTQAEMI YPPQVPEHLR FAVGQBVFGL VPGLKMYATI WLREHNRVCD 300 

VLKQEHPBWG DEQLFQTSRL ILIGETIKIV IEDYVQHLSG YHFKLKPDPB LLFNKQFQYQ 360 

40 NRIAABFNTL YHWHPLLPDT FQIHDQKYNY QQFIYNNSIL LEHGITQFVH SFTRQIAGRV 420 

AGGRNVPPAV QKVSQASIDQ SRQMKYQSFN EYRKRFMLKP YESFEELTGE KEMSABLBAti 460 

YGDIDAVELY PALLVEKPRP DAIFGETHVE VGAPFSLKGL MGNVICSPAY WKPSTFGGEV 540 

GFQIINTASI QSLICNNVKG CPFTSFSVPD PELIKTVTIN ASSSRSGLDD INPTVLLKER 600 



Seq ID NO: 134 PNA sequence 

Nucleic Acid Accession #i XNJJS9648.1 

Coding sequence: 35-664 (underlined sequences correspond to start and stop codons) 

50 

1 11 21 31 41 51 

I I I I I I 

AGGCTGCTGA GACTTCCCTC TAGAATCCTC CAACATGGAG CCTCTTGCAG CTTACCCGCT 60 

AAAATGTTCC GGGCCCAGAG CAAAGGTATT TGCAGTTTTG CTGTCTATAG TTCTATGCAC 120 

55 AGTAACGCTA TTTCTTCTAC AACTAAAATT CCTCAAACCT AAAATCAACA GCTTTTATGC 180 

CTTPGAAGTG AAGGATGCAA AAGGAAGAAC TGTTTCTCTG GAAAAGTATA AAGGCAAAGT 240 

TTCACTAGTT GTAAACGTGG CCAGTGACTG CCAACTCACA GACAGAAATT ACTTAGGGCT 300 

GAAGGAACTG CACAAAGAGT TTGGACCATC CCACTTCAGC GTGTTGGCTT TTCCCTGCAA 360 

TCAGTTTGGA GAATCGGAGC CCCGCCCAAG CAAGGAAGTA GAATCTTTTG CAAGAAAAAA 420 

60 CTACGGAGTA ACTTTCCCCA TCTTCCACAA GATTAAGATT CTAGGATCTG AAGGAGAACC 480 

TCCATTTAGA TTTCTTQTTQ ATTCTTCAAA GAAGGAACCA AGGTGGAATT TTTGGAAGTA 540 

TCTTGTCAAC CCTGAGGGTC AAGTTGTGAA GTTCTGGAAG CCAGAGGAGC CCATTGAAGT 600 

CATCAGGCCT GACATAGCAG CTCTGGTTAG ACAAGTGATC ATAAAAAAGA AAGAGGATCT 660 

ATGAGAATGC CATTGCGTTT CTAATAGAAC AGAGAAATGT CTCCATGAGG GTTTGGTCTC 720 

65 ATTTTAAACA TTTTTTTTTT GGAGACAGTG TCTCACTCTG TCACCCAGGC TGGAGTGCAG 780 

TAGTGCGTTC TCAGCTCATT GCAACCTCTG CCTTTTTAAA CATGCTATTA AATGTGGCAA 840 

TGAAGGATTT TTTTTTAATG TTATCTTGCT ATTAAGTGGT AATGAATGTT CCCAGGATGA 900 

GGATGTTACC CAAAGCAAAA ATCAAGAGTA GCCAAAGAAT CAACATGAAA TATATTAACT 960 

ACTTCCTCTG ACCATACTAA AGAATTCAGA ATACACAGTG ACCAATGTGC CTCAATATCT 1020 

70 TATTGTTCAA CTTGACATTT TCTAGGACTG TACTTGATGA AAATGCCAAC ACACTAGACC 1080 

ACTCTTTGGA TTCAAGAGGA CTGTGTATGA CTGAAATTTC TGGAATAACT GTAAATGGTT 1140 

ATGTTAATGG AATAAAACAC AAATGTTGAA AAATGTAAAA TATATATACA TAGATTCAAA 1200 

TCCTTATATA TGTATGCTTG TTTTGTGTAC AGGATTTTGT TTTTTCTTTT TAAGTACAGG 1260 

TTCCTAGTGT TTTACTATAA CTGTCACTAT GTATGTAACT GACATATAXA AATAGTCATT 1320 

75 TATAAATGAC CGTATTATAA CA 
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Seq ID No: 135 Profceftn Sequence; 
Protein Accession #: XP_059648.1 



1 11 21 31 41 51 

I I I I I I 

MBPLAAYPLK CSOPRAKVFA VliSIVLCTV TLFLLQLKFL KPKINSFYAF BVKDAKGBTV 60 
SLEKYKGKVS LWNVASDCQ LTDRNYLQIiK BLHKSFGPSH FSVIAFPCNQ FGESEPRPSK 120 
EVESFARKNY GVTFPIFHKI KILG8EGEPA FRFLVDSSKK EPRWNFWKYL VNPEGQWKP 180 
WKPBEPISVI RPDIAALVRQ V1IKKKEDL 



Seq ID NO: 136 pNA sequence. 

Nucleic Acid Accession #: NM_003003.1 

Coding sequence: 304-2451 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAAGTGCCGT CGCCGCGCCC CTTCCCCCTC CCGOCTCCCC GGCCCCCTCC CCGGAACCGG 60 

CGGTCGAGCT ACGGTCO CQQ ACGAOTGGAA CCGAGACTGC CCCOCGGAGC OGCCGOTAT0 120 

AGCGCCCCTC OCCACCCCGT GTCCCAGGCC CGGCCTTTCT GACAAGAQCT AGACTTCGGG 180 

CTCCTTGAGG ATATTCAGTT TTGTATGTTT GAATATCCTC TCACCATOTT CAQCATAAAQ 240 

TACCATTCTT AATQATTATC CTCAACAAGA CAGGTGTGAQ AGGGTTGCTG TTGCATTQCA 300 

ATCATGCTGC AAAAATACCA GTCCCCAGTG AGAGTGTACA AATACCCCTT TGAATTAATT 360 

ATGGCTGCCT ATGAAAGGAG GTTCCCTACA TGTCCTTTGA TTCCGATGTT CGTGGGCAGT 420 

6ACACTGTGA GTGAATTCAA GAGCGAAGAT GGGGCTATTC ATGTCATTGA AAGGCGCTCC 480 

AAGCTGGATG TAGATGCACC CAGACTGCTG AAGAAGATTG CAGGAGTTGA TTATGTTTAT 540 

TTTGTCCAGA AAAACTCACT GAATTCTCGG GAACGTACTT TQCACATTGA GGCTTATAAT 600 

GAAACGTTTT CCAATOGGGT CATCATTAAT GAGCATTQCT GCTACACCGT TCACCCTGAA 660 

AATGAAGATT GGACCTGTTT TGAACAGTCT GCAAGTTTAG ATATTAAATC TTTCTTTGGT 720 

TTTGAAAGTA CAGTGGAAAA AATTGCAATG AAACAATATA CCAGCAACAT TAAAAAAGGA 780 

AAGGAAATCA TCGAATACTA CCTTCGCCAA TTAGAAGAAG AAGGCATAAC CTTTGTGCCC 840 

CQTTQGAGTC CGCCTTCCAT CACGCCCTCT TCAGAGACAT CTTCATCATC CTCCAAGAAA 900 

CAAGCAGCGT CCATGGCCGT CGTCATCCCA GAAGCTGCCC TCAAGGAGGG GCTGAGTGGT 960 

GATGCCCTCA GCAGCCCCAG TGCACCTGAG CCCGTGGTGG GCACCCCTGA CGACAAACTA 1020 

GATGCCGACC ACATCAAGAG ATACCTGGGC GATTTGACTC CGCTGCAGGA GAGCTGCCTC 1080 

ATTAGACTTC GCCAGTGGCT CCAGGAGACC CACAAGGGCA AAATTCCAAA AGATGAGCAT 1140 

ATTCTTCOGT TCCTCCGTGC ACGGGATTTT AATATTGACA AAGCCAGAGA GATCATGTGT 1200 

CAGTCTTTGA CGTGGAGAAA GCAGCATCAG GTAGACTACA TTCTTGAAAC CTGGACCCCT 1260 

CCTCAGGTCC TTCAGGATTA CTACGCGGGA GGCTGGCATC ATCACGACAA AGATGGGCGG 1320 

CCCCTCTACG TGCTCAGGCT GGGGCAGATG GACACCAAAG GCTTGGTGAG AGCGCTOGGG 1380 

GAGGAAGCCC TGCTGAGATA CGTTCTCTCC GTAAATGAAG AACGGCTAAG GCGATGGGAA 1440 

GAGAATACAA AAGTCTTTGG TCGGCCTATC AGCTCATGGA CCTGCCTGGT GGACTTGGAA 1500 

G6GCTGAACA TGCGCCACTT GTGGAGACCT GGTGTGAAAG CGCTGCTGCG GATCATCGAG 1560 

GTGGTGGAGG CCAACTACCC TGAGACACTG GGCCGCCTTC TCATCCTGCG GGCGCCCAGG 1620 

GTATTTCCTG TOCTCTGGAC GCTGGTTAGT CCGTTCATTG ATGACAACAC CAGAAGGAAG 1680 

TTCCTCATTT ATGCAGGAAA TGACTACCAG GGTCCTGGAG GCCTGCTGGA TTACATCGAC 1740 

AAAGAGATTA TTCCAGATTT CCTGAGTGGG GAGTGCATGT GOGAAGTGCC AGAGGGTGGA 1800 

CTGGTCCCCA AATCTCTGTA CCGGACTGCA GAGGAGCTGG AGAACGAAGA CCTGAAGCTC 1860 

TGGACTOAGA CCATCTACCA GTCTGCAAGC GTCTTCAAAG GAGCCCCACA TGAGATTCTC 1920 

ATTCAGATTG TGGATGCCTC GTCAGTCATC ACTTGGGATT TCGACGTGTG CAAAGGGGAC 1980 

ATTGTGTTTA ACATCTATCA CTCCAAGAGG TCGCCACAAC CACCCAAAAA GGACTCCCTG 2040 

GGAGCCCACA GCATCACCTC TCCGGGTGGG AACAATGTGC AGCTCATAGA CAAAGTCTGG 2100 

CAGCTGGGCC GCGACTACAG CATGGTGGAG TCGCCTCTGA TCTGCAAAGA AGGAGAAAGC 2160 

GTGCAGGGTT CCCATGTGAC CAGGTGGCOG GGCTTCTACA TCCTGCAGTG GAAATTCCAC 2220 

AGCATGCCTG CGTGCGCCGC CAGCAGCCTT CCCCGGGTGG ACGACGTGCT TGCGTCCCTG 2280 

CAGGTCTCTT CGCACAAGTG TAAAGTGATG TACTACACCG AGGTGATCGG CTCGGAGGAT 2340 

TTCAGAGGTT CCATGACGAG CCTGGAGTCC AGCCACAGCG GCTTCTCCCA GCTGAGTGCC 2400 

OCCACCACCT CCTCCAGCCA GTCCCACTCC AGCTCCATGA TCTCCAGGJA_gTGCCGCGCT 2460 

GCCTGCACCT AGTGTGCAGA GGGGACGGCC GCCCCTCCTC GGACAGCAGC TGCACCCGCC 2520 

CACCCAGCGG CGACATTGTA CAGACTCCTC TCACCTCTAG ATAGCAAATA GCTCTCAGAT 2580 

GGTAAACGTA GTCGTTTGAT CCCAAAACTA CCTTGQCAGG TAGTTTTAAC TCTGATCCTA 2640 

ACTTAACTCA ATAGCCATAG ATTTTOTATA CGTTGTGCAC AAAATCCAAC CAGAGCGCAA 2700 

GGGCTCTCTT GAAAGAAAAG TAGTTTCTGT ACCAATTAAA GGATTGACGT GGTCTCAGAT 2760 

ATTGATGCAA AAAATTTTTC CAAOGAACTC CGCATTOTCC ATTAGTGAAT GAATTCCTGT 2820 

GACATCCTCC AGAGATGGCC CCTCCTCACC TGGGACGGAA GCTQCCAGCT CGCTTCCCCC 2880 

AAGCTQCCTC ATGGCCCGCA CGCCGCCTCA CGGCCCCCAT GCTTCCCGCC AGTCAAGATG 2940 

GTCTGTGGAC TTAGGGCCAG CCCTTGAGGT CCTTATCCTC TGAGGATTCA GAGGTTGCCT 3000 

GCGGAGTACC TTGTCCCAGG GCCAGACACA CCCACACCAC CCACTGTCTG CAGTGGGGCC 3060 

GGGGGCTCAG GAGGGGCTCT CAGGGACTCC TGGTGACTCC AGGAAAATGC TGCCATOGTT 3120 

AAACATTACT TTCTCTTTCC TCCTTTTCAA ATCTTTTTGA TACTTTTTAG AGCAGGATTT 3180 

TTCTGTATGT GAACTTGGGT GGGGGGGTTC TTCCCGTTTC CTTCCGTGCG TCGCCCCTCT 3240 

CACCTGCAGT CAGCTCCCAG CCCAGTGTAG GCCATCTCCT CTGTGCCCTC TGGAGGCTCA 3300 

TTGTCTCAGA GCCCAGACAG TTCCAGCCAC TAGGAGGCCG TCTTGGAACC AGCAAGTCGC 3360 

ATTTGCCACT TGACACTGTC CATGGGGTTT TATTAGTAGC TAAGCAGCAG CTCTCGCATC 3420 
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CACTTCAGGG TGGOGT6TGO CATGTAGGAG TCCTGCTTCT TTGTACATGG GAATTGTGGA 3480 

CTCATGCGTG T8T0TOTOTQ CATGTGCTGT GTGTQTGCAT GTGTGCATGA CGGTGGGGGT 3540 

OCTGGGGGGA CGGGGTGAOT GGAAACTTAG TTTGAGTAAT GAAGGAATCT TCACAGAAGC 3600 

AAATCAGAAT ATOOGATTTG TTTGCCTTTT ACATTTTGTT TAATTCCTGA TTTTAAAOCC 3660 

5 TOCTCTATCT GGTACAGGCC CTTATTTTTT CAOCTTTTTA TGGGAAAAGC AGGTTATTTG 3720 

AGAATCTGTC CAGAAGTTGC ATAGGGGATG GCCTCCAOGA TAAGOACATO CAACACGTGT 3780 

TTCTQTGTGC AGCAGAGGCC GTGTTTTTCA TGCCAAACCC CACGCGGCTG TCAACTGTGT 3840 

GCGTGOTAGG CATGGAGATC CTGGTTGTGC CGTCTCACCT COGCTCTGAA GGCACTOTGT 3900 

GGOTGCTGCG TGACTGGAGA GCTGTGTGQA GGCCATGTGT 0CCCCGT3CA GOOATCAGGA 3960 

10 GGGCGGGGOA GGGACCGAGC AGCCCTCTTG CCCGGTCGGG TCAGCCCTAG TGGCTGCCTG 4020 

CACACTGTAG ACOTCCCAGG GCCTGTGCTG TGATCACCTG CCTTTGGACC ACATTTGTGT 4080 

TTGCTCTTAG AGATCGAGCT CCTCAGTGGT ACCTGAAGCC TTTGCTTCOG GAAAGOGOGG 4140 

TAGGGTTOGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA 4200 

GTAGGTAGGG TTAGTAGGTA GGGTTOGTAG GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG 4260 

15 QGCTAGTAGG TAGGGTTOGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA 4320 

GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTOGTAG GTAGGGCTGG TAGGTAGGGT 4380 

TAGTAGGTAG GGCTAGTAGG TAGGGTTOGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT 4440 

AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTOGTAG GTAGGGCTGG 4500 

TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT AGTAGGTAGG 4560 

20 GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTOGTAG 4620 

GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT 4680 

AGTAGGTAGG GCTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG CTAGTAGGTA 4740 

GGGTTOGTAG GTAGGGTTCG TAGGTAGGGT TCGTAGGTAG GGTTAGTAGC GCGTCTGTGC 4800 

TGCTTCCACC TGGTGCTTCC TGTTCCCAAA TCACAAGGGC CTGAAGGTGG TCCCTGCTTT 4660 

25 CTCTTTCTCT TTCTCTGTGT CTCAGATGGC GATTTTGCTG ACAGCTGCCA AGAAAATGCT 4920 

TCACTCAACA GTCCTCATGT GCCCAGAGAT GTTTATAGAA CTGTTTGAAT TGCAGCCATC 4980 

CCCTGCCCCC TOCCAGGCTG AAGATCTGTT CTTTTTAAGT TGATTCGGGA GTGGCATTCT 5040 

TTTATACCCA AAGACTGTAG TGCATCTTGA AGAGCTCAAA QCACATGACC GCACAAATGC 5100 

TTACAGGGTT TCCTCCCGAG TAATCCAATC TCACTCCCCT TGTAAGGGAA TTCTGGGGCA 5160 

30 GCTATGGTTT GAGTATGCAG TTTGCATCGT GTTTCTACCT TTAGTACCTT GCCACTCTTT 5220 

TAAAACGCTG CTGTCATTTC CCATTTCTTA GTACTAATGA TTCTTTGATT CTCCCTCTAT 5280 

TATGTCTTAA TTCACTTTCC TTCCTAAATT TGTTATTTGC ATATCAAATT CTGTAAATOT 5340 

TTTGTAAACA TATTACCTCA CTTGGTAATA CAATACTGAT AGTCTTTAAA AGATTTTTTT 5400 
ATTGTTATCA ATAATAAATG TGAACTATTT AAAG 
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Seq ID No: 137 Prgfrein sequence? 
Protein Accession #: MP 002994.1 



40 1 11 21 31 41 51 

I I I I 1 I 

MVQKYQSPVR VYKYPFELIM AAYBRRPPTC PLIPMPVGSD TVSEFKSEDG AIHVIERRCK 60 

LDVDAPRLLK KIAGVDYVYF VQKNSLNSRE RTLHIEAYNE TFSNRVIINB HCCYTVHPEN 120 

EDWTCFEQSA SIiDIKSFFGP ESTVEKIAMK QYTSNIKKGK EIIEYYEjRQL EBEGITFVPR 180 

45 WSPPSITPSS ETSSSSSKKQ AASMAWIPE AALKEGLSGD ALSSPSAPBP WGTPDDKItD 240 

ADHIKRYLGD LTPLQESCLI RXjRQHLQETH KGKIPKDBHI LRFLRARDFN IDKARBIHOQ 300 

SLTWRKQHQV DYILETWTPP QVLQDYYAGG WHHHDKDGRP LYVLRLGQMD TKGLVRALGB 360 

EALLRYVLSV NEERLRRCEB KTKVFGRPIS SWTCLVDLEG LNMRHLWRPG VKALLRIIEV 420 

VEANYPETLG RLLILRAPRV FPVLWTLV8P FIDDNTRRKF LIYAGNDYQG PGGLLDYIDK 480 

50 EIIPDFLSGB CMCBVPEGGL VPKSLYRTAE KLENEDLKLW TETIYQSASV PKGAPHEIM 540 

QIVDASSVIT WDFDVCKGDI VPNIYHSKRS PQPPKKDSLG AHSITSPGGN NVQLIDKVWQ 600 

LGRDYSMVES PLICKEGESV QGSHVTRWPG FYILQWKFHS MPACAASSLP RVDDVLASLQ 660 
VSSHKCKVKY YTEVIGSEDF RGSMTSLESS HSGFSQLSAA TTSSSQSHSS SNISR 

55 Seq ID NO: 138 DNA eecnience 

Hucleic Acid Accession ft< NM^O 04181.1 

Coding sequence: 32-670 (underlined sequences correspond to start and stop codons) 

60 1 11 21 31 41 51 

I I I I I I 

GCAGAAATAG CCTAGGGAGA TCAACCCOGA GATGCTGAAC AAAGTGCTGT CCCGGCTGGG 60 

GGTCGCCGGC CAGTGGCGCT TCGTGGACGT GCTGGGGCTG GAAGAGGAGT CTCTGGGCTC 120 

GGTGCCAGCG CCTGCCTGCG CGCTGCTGCT GCTGTTTCCC CTCACGGCCC AGCATGAGAA 180 

65 CTTCAGGAAA AAGCAGATTG AAGAGCTGAA GGGACAAGAA GTTAGTCCTA AAGTGTACTT 240 

CATGAAGCAG ACCATTGGGA ATTCCTGTGG CACAATCGGA CTTATTCACG CAGTGGCCAA 300 

TAATCAAGAC AAACTGGGAT TTGAGGATGG ATCAGTTCTG AAACAGTTTC TTTCTGAAAC 360 

AGAGAAAATG TCCCCTGAAG ACAGAGCAAA ATGCTTTGAA AAGAATGAGG CCATACAGGC 420 

AGCCCATGAT GCCGTGGCAC AGGAAGGCCA ATGTCGGGTA GATGACAAGG TGAATTTCCA 480 

70 TTTTATTCTG TTTAACAACG TGGATGGCCA CCTCTATGAA CTTGATGGAC GAATGCCTTT 540 

TCCGGTGAAC CATGGCGCCA GTTCAGAGGA CACCCTGCTG AAGGACGCTG CCAAGGTGTG 600 

CAGAGAATTC ACOGAGCGTG AGCAAGGAGA AGTCCGCTTC TCTGCCGTGG CTCTCTGCAA 660 

GGCAGCCiaa TGCTCTGTGG GAGGGACTTT GCTGATTTCC CCTCTTCCCT TCAACATGAA 720 

AATATATACC CCCCATGCAG TCTAAAATGC TTCAGTACTT GTGAAACACA GCTGTTCTTC 780 

75 TGTTCTGCAG ACACGCCTTC CCCTCAGCCA CACCCAGGCA CTTAAGCACA AGCAGAGTGC 840 

ACAGCTGTCC ACTGGGCCAT TGTGGTGTGA GCTTCAGATG GTGAAGCATT CTCCCCAGTG 900 
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TATGTCTTGT ATCCGATATC TAACGCTTTA AATGGCTACT TTGCTTTCTO TCTGTAAQTT 960 
AAGACCTTOG ATGTGGTTAT GTTOTCCTAA AGAATAAATT TTGCTGATAG TAGC 



Seq ID No: 139 Protein, sequence^ 
Protein Accession #i NPJ>04172 .1 



1 11 21 31 41 51 

I I III I 

MLNKVLSRLG VAGQWRFVDV LGLEEESLQS VPAPACALLL LFPLTAQHEN PRKKQIBBLK 60 
GQBVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFSDG SVLKQFLSET EKMSPEDRAK 120 
CFEKNBAIQA AHDAVAQBGQ CRVDDKVNPR FILFNNVDGH LYELDQRMPF PVNHGASSED 180 
TLLKDAAKVC RSFTEREQGE VRFSAVALCK AA 

Seq ID NO: 140 DNA sequence 

Nucleic Acid Accession ft: NM_000201.1 

Coding sequence: 56-1656 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GCGCCCCAGT CGACGCTGAG CTCCTCTGCT ACTCAGAOTT GCAACCTCAG CCTCGCTATG 60 

GCTCCCAGCA GCCCCCGGCC CGCGCTGCCC GCACTCCTGG TCCTGCTCGG GGCTCTGTTC 120 

CCAGGACCTG GCAATGCCCA GACATCTGTG TCCCCCTCAA AAGTCATCCT GCCCCGGGGA 180 

GGCTCCGTGC TGGTGACATG CAGCACCTCC TGTGACCAGC CCAAGTTGTT GGGCATAGAG 240 

ACCCCGTTGC CTAAAAAGGA GTTGCTCCTG CCTQGGAACA ACCGGAAGGT GTATGAACTG 300 

AGCAATGTGC AAGAAGATAG CCAACCAATG TOCTATTCAA ACTGCCCTGA TGGGCAGTCA 360 

ACAGCTAAAA CCTTCCTCAC CGTGTACTGG ACTCCAGAAC GGGTGGAACT GGCACCCCTC 420 

CCCTCTTGGC AGCCAGTGGG CAAGAACCTT ACCCTAOGCT GCCAGGTGGA GGGTSGGGCA 480 

CCCCGGGCCA ACCTCACCGT GGTGCTGCTC CGTGGGGAGA AGGAGCTGAA ACGGGAGCCA 540 

GCTGTGGGGG AGCCCGCTGA GGTCACGACC ACGGTGCTGG TGAGGAGAGA TCACCATGGA 600 

GCCAATTTCT CGTGCCGCAC TGAACTGGAC CTGCGGCCCC AAGGGCTGGA GCTGTTTGAG 660 

AACACCTCGG CCCCCTACCA GCTCCAGACC TTTGTCCTGC CAGCGACTCC CCCACAACTT 720 

GTCAGCCCCC GGGTCCTAGA GGTGGACACG CAGGGGACCG TGGTCTGTTC CCTGGACGGG 780 

CTGTTCCCAG TCTCGGAGGC CCAGGTCCAC CTGGCACTGG GGGACCAGAG GTTGAACCCC 840 

ACAGTCACCT ATGGCAACGA CTCCTTCTGG GCCAAGGCCT CAGTCAGTGT GACCGCAGAG 900 

GACGAGGGCA CCCAGCGGCT GAOGTGTGCA GTAATACTGG GGAACCAGAG CCAGGAGACA 960 

CTGCAGACAG TGACCATCTA CAGCTTTCCG GCGCCCAACG TGATTCTGAC GAAGCCAGAQ 1020 

GTCTCAGAAG GGACCGAGGT GACAGTGAAG TGTGAGGCCC ACCCTAGAGC CAAGGTGACG 1080 

CTGAATGGGG TTCCAGCCCA GCCACTGGGC CCGAGGGCCC AGCTCCTGCT GAAGGCCACC 1140 

CCAGAGGACA ACGGGCGCAG CTTCTCCTGC TCTQCAACCC TGGAGGTGGC CGGCCAGCTT 1200 

ATACACAAGA ACCAGACCCG GGAGCTTCGT GTCCTGTATG GCCCCCGACT GGACGAGAGG 1260 

GATTGTCCGG GAAACTGGAC GTGGCCAGAA AATTCCCAGC AGACTCCAAT GTGCCAGGCT 1320 

TGGGGGAACC CATTGCCCGA GCTCAAOTOT CTAAAGGATG GCACTTTCCC ACTGCCCATC 1380 

GGGGAATCAG TGACTGTCAC TCGAGATCTT GAGGGCACCT ACCTCTGTCG GGCCAGGAGC 1440 

ACTCAAGGGG AGGTCACCCG CGAGGTGACC GTGAATGTGC TCTCCCCCCG GTATGAGATT 1500 

GTCATCATCA CTGTGGTAGC AGCCGCAGTC ATAATGGGCA CTGCAGGCCT CAGCACGTAC 1560 

CTCTATAACC GCCAGCGGAA GATCAAGAAA TACAGACTAC AACAGGCCCA AAAAGGGACC 1620 

CCCATGAAAC CGAACACACA AGCCACGCCT CCCTGAACCT ATCCCGGGAC AGGGCCTCTT 1680 

CCTCGGCCTT CCCATATTGG TGGCAGTGGT GCCACACTGA ACAGAGTGGA AGACATATGC 1740 

CATGCAGCTA CACCTACCGG CCCTGGGAOG CCGGAOQACA GGGCATTGTC CTCAGTCAGA 1800 

TACAACAGCA TTTGGGGCCA TGGTACCTGC ACACCTAAAA CACTAGGCCA CGCATCTGAT 1860 

CTGTAGTCAC ATGACTAAGC CAAGAGGAAG GAGCAAGACT CAAGACATGA TTGATGGATG 1920 

TTAAAGTCTA GCCTGATGAG AGGGGAAGTG GTGGGGGAGA CATAGCCCCA CCATGAGGAC 1980 

ATACAACTGG GAAATACTGA AACTTGCTGC CTATTGGGTA TGCTGAGGCC CACAGACTTA 2040 

CAGAAGAAGT GGCCCTCCAT AGACATGTGT AGCATCAAAA CACAAAGGCC CACACTTCCT 2100 

GACGGATGCC AGCTTGGGCA CTGCTGTCTA CTGACCCCAA CCCTTGATGA TATGTATTTA 2160 

TTCATTTGTT ATTTTACCAG CTATTTATTQ AGTGTCTTTT ATOTAGGCTA AATGAACATA 2220 

GGTCTCTGGC CTCACGGAGC TCCCAGTCCA TGTCACATTC AAGGTCACCA GGTACAGTTG 2280 

TACAGGTTGT ACACTGCAGG AGAGTGCCTG GCAAAAAGAT CAAATGGGGC TGGGACTTCT 2340 

CATTGGCCAA CCTGCCTTTC CCCAGAAGGA GTGATTTTTC TATCGGCACA AAAGCACTAT 2400 

ATGGACTGGT AATGGTTCAC AGGTTCAGAG ATTACCCAGT GAGGCCTTAT TCCTCCCTTC 2460 

CCCCCAAAAC TGACACCTTT GTTAGCCACC TCCCCACCCA CATACATTTC TGCCAGTGTT 2520 

CACAATGACA CTCAGOGGTC ATGTCTGGAC ATGAGTGCCC AGGGAATATG CCCAAGCTAT 2560 

GCCTTGTCCT CTTGTCCTGT TTGCATTTCA CTGGGAGCTT GCACTATTGC AGCTCCAGTT 2640 

TCCTGCAGTG ATCAGGGTCC TGCAAGCAGT GGGGAAGGGG GCCAAGGTAT TGGAGGACTC 2700 

CCTCCCAGCT TTGGAAGGGT CATCCGCGTG TGTGTGTGTG TGTATGTGTA GACAAGCTCT 2760 

CGCTCTGTCA CCCAGGCTGG AGTGCAGTGG TGCAATCATG GTTCACTGCA GTCTTGACCT 2820 

TTTGGGCTCA AGTGATCCTC CCACCTCAGC CTCCTGAOTA GCTGGGACCA TAGGCTCACA 2880 

ACACCACACC TGGCAAATTT GATTTTTTTT TTTTTTTTCA GAGACGGGGT CTCGCAACAT 2940 
TGCCCAGACT TCCTTTGTGT TAGTTAATAA AGCTTTCTCA ACTGCC 

Seq ID No: 141 Protejn sequence: 
Protein Accession #: NP_000192.1 
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1 11 21 31 41 51 

I I I I I I 

MLQFVRAGAR AWLRPTGSQG LSSLAEBAAR ATENPBQVAS EGLPEPVLRK VELPVPTHRR 60 
PVQAWVESLR GFEQERVGLA DLHPDVFATA PRLDILHQVA MWQKMPKRIS YAKTKTRAEV 120 
RGGGGKPLAA BRKWAGPAWQ HPLSALARRR CCFWPPGPTS YYYMLPMKVR AU3LKVALTV 180 
KLAQDDLHIM DSLELPTGDP QYLTELAHYR RWGDSVLLVD LTHEEMPQSI VEATSRLKTF 240 
NLIPAVGLNV KHKLKHQTLV LTLPTVAFLE DKLLWQDSRY RPLYPFSLPY SDFPRPLPHA 300 
TQGPAATPYH C 

Seq ID NO: 142 PWA qequence 

Nucleic Acid Accession #: NM_O00270.1 

Coding sequence i 110-979 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I i I III 

AACTGTGCGA ACCAGACCCG GCAGCCTTGC TCAGTTCAQC ATAGCGQAGC GOATCCGATC 60 

GGATCGGAGC ACACCGGAQC AG6CTCATC0 AGAAQGOOTC TGCGAGAC CA TG GAGAACGG 120 

ATACACCTAT OAAGATTATA AGAACACTGC AOAATQOCTT CTGTCTCATA CTAAGCACCG 180 

ACCTCAAOTT GCAATAATCT GTGGTTCTGG ATTAGGAGGT CTGACTGATA AATTAACTCA 240 

GGCCCAQATC TTTGACTACA GTGAAATCCC CAACTTTCCT CGAAGTACAG TGCCAGGTCA 300 

TGCTGGCCGA CTGGTGTTTG GGTTCCTGAA TGGCAGGGCC TGTGTGATGA TGCAGGGCAG 360 

GTTCCACATG TATGAAGGGT ACCCACTCTG GAAGGTGACA TTCCCAGTGA GGGTTTTCCA 420 

CCTTCTGGGT GTGGACACCC TGGTAQTCAC CAATGCAGCA GGAGOGCTOA ACCCCAAGTT 480 

TGAGGTTGGA GATATCATGC TGATCCGTGA CCATATCAAC CTACCTQGTT TCAGTGGTCA 540 

OAACCCTCTC AQAGGGCCCA ATGATOAAAG GTTTGGAGAT COTTTCCCTG CCATGTCTGA 600 

TGCCTACGAC CGGACTATGA GQCAGAGGOC TCTCAGTACC TGGAAACAAA TGGGGOAOCA 660 

ACGTGAGCTA CAGGAAGGCA CCTATGTGAT GGTGGCAGGC CCCAGCTTTG AGACTGTGGC 720 

AGAATGTCGT GTGCTGCAGA AGCTGGGAGC AGACGCTGTT GGCATGAGTA CAGTACCAGA 780 

AGTTATCGTT GCACGGCACT OTGGACTTCG AGTCTTTGGC TTCTCACTCA TCACTAACAA 840 

GGTCATCATG GATTATGAAA GCCTGGAGAA GGCCAACCAT GAAGAAGTCT TAGCAGCTGG 900 

CAAACAAGCT GCACAGAAAT TGGAACAGTT TGTCTCCATT CTTATGGCCA GCATTCCACT 960 

CCCTQACAAA QCCAQTTQAC CTGCCTTGGA GTCGTCTGGC ATCTCCCACA CAAGACCCAA 1020 

GTAGCTGCTA CCTTCTTTGG CCCCTTGCTG GAGTCATGTG CCTCTGTCCT TAGGTTGTAG 1080 

CAGAAAGGAA AAGATTCCTG TCCTTCACCT TTCCCACTTT CTTCTACCAG ACCCTTCTGG 1140 

TGCCAGATCC TCTTCTCAAA GCTGGGATTA CAGGTGTGAS CATAGTGAGA CCTTGGOGCT 1200 

ACAAAATAAA GCTGTTCTCA TTCCTGTTCT TTCTTACACA AGAGCTGGAG CCCGTGCCCT 1260 

ACCACACATC TCTGGAGATG CCCAGGATTT GACTCGGGCC TTAGAACTTT GCATAGCAGC 1320 

TGCTACTAGC TCTTTGAGAT AATACATTCC GAGGGGCTCA GTTCTGCCTT ATCTAAATCA 1380 
CCAGAGACCA AACAAGGACT AATCCAATAC CTCTTGGA 

Seq ID Noi 143 Pr^ejR sequence: 
Protein Accession #: NP_000261.1 



1 11 21 31 41 51 

i I I I I I 

KENGYTYEDY KNTAEWLLSH TKHRPQVAII CGSGLGGLTD KLTQAQIFDY SEIPNFPRST 60 
VPGHAGRLVP GFLNGRACVM KQGRFHMYEG YPLWKVTFPV RVPHLLGVDT LWTOAAGGL 120 
NPKFEVGDIM LIRDHINLPG FSGQNPLRGP NDERFGDRFP AMSDAYDRTM RQRALSTHKQ 180 
NGEQRELQEG TYVMVAGPSF ETVAECRVLQ KTiGADAVGMS TVPEVIVARH CGLRVFGFSfc 240 
ITNKVIMDYE SLEKANHEEV LAAGKQAAQK LEQFVSILMA SIPLPDKAS 

Seq ID NO: 144 DNA, sequence 

Nucleic Acid Accession #: NM_015577.1 

Coding sequencet 112-3054 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAAGCGGCGG GCGGGGTGGA GCAGCCAGCT GGGTCCGGGG AGCGCCGCCG CCGCCTCGAT 60 

GGGGTGTTGA AAAGTCTCCT CTAGAGCTTT GGAAGGCTQA ATGCACTAAA CATGAAGAGC 120 

TTGAAAGCGA AGTTCAGGAA GAGTGACACC AATGAGTGGA ACAAOAATGA TGACCGGCTA 180 

CTGCAGGCCQ TGGAGAATGG AGATGCGGAG AAGGTGGCCT CACTGCTCGG CAAGAAGGGG 240 

GCCAGTGCCA CCAAACACGA CAGTGAGGGC AAGACCGCTT TCCATCTTGC TGCTGCAAAA 300 

GGACACGTGG AATGCCTCAG GGTCATGATT ACACATGGTS TGGATGTGAC AGCCCAAGAT 360 

ACTACCGGAC ACAGCGCCTT ACATCTCGCA GCCAAGAACA GCCACCATGA ATGCATCAGG 420 

AGGCTGCTTC AGTCTAAATG CCCAGCCGAA AGTGTCGACA GCTCTQGGAA AACAGCTTTA 480 

CATTATGCAG CGGCTCAGGG CTGCCTTCAA GCTGTGCAGA TTCTCTGCGA ACACAAGAGC 540 

CCCATAAACC TCAAAGATTT GOATGGGAAT ATACOGCTGC TTCTTGCTGT ACAAAATGGT 600 

CACAGTGAGA TCTGTCACTT TCTCCTCGAT CATGGAGCAG ATGTCAATTC CAGGAACAAA 660 

AGTGGAAGAA CTGCTCTCAT GCTGGCCTGT GAGATTGGCA GCTCTAACGC TGTGGAAGCC 720 

TTAATTAAAA AGGGTGCAGA CCTAAACCTT GTAGATTCTC TTGGATACAA TGCCTTACAT 780 

TATTCCAAAC TCTCAGAAAA TGCAGGAATT CAAAGCCTTC TATTATCAAA AATCTCTCAG 840 

GATGCTGATT TAAAGACCCC AACAAAACCA AAGCAGCATG ACCAAGTCTC TAAAATAAGC 900 
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TCAGAAAGAA GTGGAACTCC AAAAACACGC AAAGCTCCAC CACCTCCTAT CAGTCCTACC 960 

CAGTTCAGTG ATGTCTCTTC CCCAAGATCA ATAACTTOOA CTCCACTATC GGGAAAGGAA 1020 

TCGGTATTTT TTGCTGAACC ACCCTTCAAO GCTGAGATCA GTTCTATACG AGAAAACAAA 1080 

GACAGACTAA GTGACAGTAC TACAGGTGCT GATAGCTTAT TGGATATAAG TTCTGAAGCT 1140 

GACCAACAAG ATCTTCTCTC TCTATTGCAA GCAAAAGTTG CTTCCCTTAC CTTACACAAT 1200 

AAGGAGTTAC AAGATAAATT ACAGGCCAAA TCACCCAAGG AGGCGGAAGC AGACCTAAGC 1260 

TTTGACTCAT ACCATTCCAC CCAAACTGAC TTGGGCCCAT CCCTGGGAAA ACCTGGTGAA 1320 

ACCTCTCCCC CAGACTCCAA ATCATCTCCA TCTGTCTTAA TACATTCTTT AGGTAAATCC 1380 

ACTACTGACA ATGATGTCAG AATTCAGCAA CIGCAAGAGA TTTTGCAAGA TCTACAGAAG 1440 

AGATTAGAGA GCTCTGAAGC AGAGAGAAAA CAGCTACAGG TCGAACTCCA ATCCCGAAGQ 1S00 

GCAGAACPGG TATGCTTAAA CAACACTGAG ATTTCAGAGA ACAGCTCTGA CCTCAGCCAG 1560 

AAACTTAAAG AAACTCAGAG CAAATACGAG GAGGCTATGA AAGAAGTCCT TAOTGTGCAG 1620 

AACCAGATGA AACTCGGTCT TGTCTCACCT GAAAGCATGG ATAATTATTC ACATTTCCAC 1680 

GAGCTGAGGG TCACGGAAGA GGAAATAAAT GTGCTAAAGC AGGATCTGCA GAATGCATTA 1740 

GAAGAAAGTG AAAGAAATAA AGAGAAAGTG AGAGAGTTAG AGGAAAAACT GGTAGAGAGG 1800 

GAGAAAGGTA CAQTGATTAA GCCACCTGTG GAAGAGTACG AGGAAATGAA AAGTTCATAT I860 

TGCTCTGTTA TTGAGAATAT GAATAAGGAG AAAGCATTTT TGTTTGAGAA ATACCAAGAA 1920 

GCCCAAGAAG AAATCATGAA ATTAAAAGAC ACACTAAAAA GTCAGATGAC ACAGGAAGCC 1980 

AOTGATGAAG CTGAGGACAT GAAAGAAGCC ATGAATAGGA TGATAGATGA ACTCAATAAA 2040 

CAGGTGAGCG AGCTGTCACA GCTGTACAAA GAAGCCCAGG CTGAGCTGGA GGATTACAGG 2100 

AAGAGGAAAT CTCTAGAGGA TGTCACAGCT GAATATATCC ATAAAGCAGA GCATGAGAAA 2160 

CTGATGCAAT TGACAAAOGT GTCCAGGGCT AAAGGAGAAG ATGCACTGTC TGAAATGAAG 2220 

TCTCAGTATT CAAAAGTGTT GAATGAGTTG ACCCAGCTCA AACAACTGGT GGATGCACAA 2280 

AAAGAGAACT CTGTCTCTAT CACAGAACAT TTGCAAGTGA TAACCACGCT GCGGACTGCA 2340 

GCAAAAGAGA TGGAAGAAAA AATAAGCAAT CTTAAGGAAC ACCTTGCAAG CAAGGAAGTG 2400 

GAAGTAGCAA AGCTGGAGAA ACAACTCTTA GAAGAGAAAG CTGCTATGAC TGATGCAATG 2460 

GTACCTCGGT CTTCCTATGA AAAACTCCAG TCATCCTTAG AGAGTGAAGT GAGTGTGTTG 2520 

GCATCGAAAT TAAAGGAATC TGTGAAAGAG AAAGAGAAGG TCCATTCAGA GGTTGTCCAG 2580 

ATTAGAAGTG AGGTCTCACA GGTGAAAAGA GAAAAGGAAA ATATTCAGAC TCTCTTGAAA 2640 

TCCAAAGAGC AAGAAGTAAA TGAACTTCTG CAAAAATTCC AGCAAGCTCA GGAAGAACTT 2700 

GCAGAAATGA AAAGATACGC TGAGAGCTCT TCAAAACTGG AGGAAGATAA AGATAAAAAG 2760 

ATAAATGAGA TGTCGAAGGA AGTCACCAAA TTGAAGGAGG CCTTGAACAG CCTCTCCCAG 2820 

CTCTCCTACT CAACAAGCTC ATCCAAAAGG CAGAGTCAGC AGCTGGAGGC GCTGCAGCAG 2880 

CAAGTCAAAC AGCTCCAGAA CCAGCTGGCG GAATGCAAGA AACAACACCA GGAGGTCATA 2940 

TCAGTTTACA GAATGCATCT TCTGTATGCT GTGCAGGGCC AGATGGATGA AGATGTCCAG 3000 

AAAGTACTGA AGCAAATCCT TACCATGTGT AAAAACCAGT CTCAAAAGAA GTAAAGTGGA 3060 

TTCCTTGGCA GGACACTGCC CCTTGTCATC TGTCTTTGTG TTAGATCCAG AGTTGTCGGC 3120 

AGCCGCTGCC ATTGTTCTCA TTCGTGGTAT GCACTGTGGC CTAGOOTAGC TTCTTCCCTT 3180 

TCCAAAGGTT TCTGAGGACT TCTCCCAGGA GAAGACTGCC CGCCTCAGAA CTGCTTAGAG 3240 

ACTTCAAACC AGCAGAGGTG AAAGTCCCTG TCATCCCTTC AGATTCCAGA GCTGGGATCA 3300 

GCCATGCCCA GAGGTCTGGT CCTGATGCTG GCAGGGGGGC CCCCTCCTCC ATCCCTGACT 3360 

GGCTGAGTGG CTTTATCACC ACCGAGTGAT GTGCTGAGGC CTCCTGCAGT GAATGCTCCT 3420 

TCCATTCCTG TACTCGGGCA GTGCCATTCA GCACAGGAGA GCTCTTTTTG CCTTTGGCTT 3460 

TCAATTCCAA AACATGATTT AATTTCTAAC TAAATTAGTA TGGCACTAGT TATGAAGTAT 3540 

CTGCTTAAAA CCCTTCATCA TGATATCCTG TGGATTTAAA AACTCTAATT CCATGTTTTC 3600 

TTCCCATCTG CCTTATATAT CTCATCACCC TGCTTATCAA TATTCAGTTT GATGAGCACT 3660 

ATTAACTAAA ATATGAAACT TAAAAACAAA AGCAACTTGT CCTTAAAAGT TCTTTTTTTA 3720 

AGTAAATTGT TGACATACTG CAAATTTTCT ATGCAAACTT GCCTCCTGCT GTTATCTGTG 3780 

AAGCTCAGGA AATCCAAACA TTTGTGTTTC AACAAGGGAC AGTAAACTGT GTGTTTACAG 3840 

CCAAAAGAAA TGCCTCATAG TTCTTAACCT CAACTTTTGT AGAAGTATTT TTTTCTCTGT 3900 

AATATTTTTA TTGGCTCATA AAGATGTTTT CATATCTGAA CTCCTAAATA AGTGAAATTA 3960 

CAGTAGATTA TATTAACAAA ATACTTTTTA GGTAGCCATG CTTGAGACTT TTTAAAAATA 4020 

TAACTTTTTC CTTAAAGTTT TCAGCTATAG CAAAAGGTAG TTATGTATGC CAGACCTAAT 4080 

ATGAGCTGCC ACCAACACCC CTAGAACTTT CAGCCATGGT GTCTTCAGAA TTGTAGCGCA 4140 

TTTCTGAATC TAGCAAATCC TCCTTTTACC CGTTGAATGT TTTGAATGCC CTGACTCTAC 4200 

CAGCGCCCAT AAATGATCTC TAGAAGGACT GTTAGTACCA ATCTGTTTTT CAACTTTGAA 4260 

GCTAAAAACC CTGATATGGT AATATTATGG TGCATAGCAG AGGTCTCGGA AAAAAAATAT 4320 

TTCTGTTCAC TTTACTTTCA GGTTAAAAAT GTTTCTAACA CGCTTGCAAC TTCCCTTATG 4380 

GCATTAATCT TGTTGAGGGA GAGAGACAGA ATCCTGGACT CTCCAAAGTA TTTAACTGAA 4440 

AGTAGGGCCT GCTCTGACAG GGCCCATGTC CCACAAGGCT GCTTGGCCTC AGTGGGTGCT 4500 

TGGCTGTGCT GGATGATATG TTGATCTGTA TTGGATAAGG ACCAATGACA GCAAAGCAAA 4560 

AATGGCTTTA AAGCTTGGTG TTACTTTTCT TAAGTTGTTT AATTATAGTT AAGCAATTTC 4620 

AAAAATGCTC CAAAGAAATG TGAAAGGACC TTTTGTCACA GCACTTCAGA AAATACACAA 4680 

CAGCCCCTTC TGCCCCCGCA CAGAAATGCT GCAGAGTATA TAAAACTTGA GACATTTTTG 4740 

TAGGATGCCT GAOGAGGTGT AGCCTTTTAT CTTGTTTCCG GATGCATATT TATTACGAGT 4800 

ACTCTGGTTA AATATTGAAA AGTTATATGC TGTAGTTTTT AGTATTTTGT CTTTGTAATT 4860 

TACAGAAGTT ATTGGAGAAA ATAAACTTGT TTCATTTTGC AAAAAAAAAA AAAAAAAAAA 4920 
AAAAA 

Seq ID No: 145 Profrgjlp aequeqce? 
Protein Acceaaion #> NP_056392.1 
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MKSLKAKFRK 5DTNEWNKND DRLLQAVENQ DABKVASLLG KKGASATKHD 8BGKTAFHLA 60 



248 



WO 02/079492 



PCT/US02/04915 



AAKGHVECLR VNITHGVDVT AQDTTGHSAL HLAAKNSHHE CIRRLLQSKC PAESVDSSOK 120 

TALHYAAAQG CLQAVQILCE HKSPINLKDL DGNIPLLLAV QNGHSEICHP LLDHQADVNS 180 

RNKSGRTALM LACEIGSSNA VEALIKKGAD LNLVDSLGYN AI4HYSKLSEN AGIQSLLLSK 240 

ISQDADLKTP TKPKQHDQVS KISSERSGTP KTRKAPPPPI SPTQLSDVSS PR5ITSTPLS 300 

5 GKBSVFFABP PPKABISSIR BNKDRLSDST TGADSLLDIS SSADQQDLLS LLQAKVASLT 360 

LHNKELQDKL QAKSPKEAEA DI*SPDSYHST QTDLGPSLGK PGBTSPPDSK SSPSVLIHSL 420 

GKSTTDNDVH IQQLQEILQD LQKRLBSSBA ERKQLQVBLQ SRRAKLVCLH NTBISBNSSD 480 

LSQKLKBTQS KYBBAMKEVL SVQKQMKUGL VSPBSMDNYS HFHELRVTEE EINVLKQDLQ 540 

KAI*EHSERNK EKVRELEBKL VERBKGTVTK PPVEEYBEMK S5YC5VIENM NKBKAFLFBK 600 

10 YQEAQEEIMK LKDTLKSQMT QEASDBABDM KEAMNRMIDE LHKQVSELSQ LYKEAQAEXtE 660 

DYRKRKSLED VTABYIHKAE HBKLKQLTNV SRAKASOALS EMKSQYSKVL NKLTQLKQLV 720 

DAQKENSVSI TEHLQVTTTL RTAAKBMEEK ISNLKEHLAS KEVBVAKLEK QltLBBKAAMT 780 

DAMVPRSSYS KLQSSLESEV SVLASKLKES VKEKEKVHSB WQIRSEVSQ VKREKENIQT 840 

LfcKSKEQEVN BLLQKPQQAQ EELAEMKRYA BSSSKLEEDK DKKXNEMSKE VTKLKEALNS 900 

15 LSQLSYSTS8 SKRQSQQLEA LQQQVKQLQN QLABCKKQHQ EVISVYRNHL LYAVQGQMDE 960 
DVQKVLKQIL TKCKNQSQKK 

8eq ID NO: 146 PNA spqyencq 
Nucleic Acid Accession #: NM_0004S9.1 

20 Coding sequence: 149-3523 (underlined sequences correspond to start and stop codonej 
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25 CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG AOGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 120 

GAAACTGGAT GGAGAGATTT GGGGAAGCAXJiQACTCTTTA GCCAGCTTAG TTCTCTGTGG 180 

AGTCAGCTTO CTCCTTTCTG GAACTGTGGA AGGT3CCATG GACTTGATCT TGATCAATTC 240 

CCTACCTCTT GTATCTGATG CTGAAACATC TCTCACCTGC ATTGCCTCTG GGTGGCCCCC 300 

30 CCATGAGCCC ATCACCATAG GAAGGGACTT TGAAGCCTTA ATGAACCAGC ACCAGGATCC 360 

GCTGGAAGTT ACTCAAGATC TCACCAGAGA ATQGGCTAAA AAAGTTGTTT GGAAGAGAGA 420 

AAAGGCTAGT AAGATCAATG GTGCTTATTT CTGTGAAGGG CGAGTTCGAG GAGAGGCAAT 480 

CAGGATACGA ACCATGAAGA TGCGTCAACA AGCTTCCTTC CTACCAGCTA CTTTAACTAT 540 

GACTGTGGAC AAGGGAGATA ACGTGAACAT ATCTTTCAAA AAGGTATTGA TTAAAGAAGA 600 

35 AGATGCAGTG ATTTACAAAA ATGGTTCCTT CATCCATTCA GTGCCCCGGC ATGAAGTACC 660 

TGATATTCTA GAAGTACACC TGCCTCATGC TCAGCCCCAG GATGCTGGAG TGTACTCGGC 720 

CAGGTATATA "GGAGGAAACC TCTTCACCTC GGCCTTCACC AGGCTGATAG TCCGGAGATG 780 

TGAAGCCCAG AAGTGGGGAC CTGAATGCAA CCATCTCTGT ACTGCTTGTA TGAACAATGG 840 

TOTCTGCCAT GAAGATACTG GAGAATGCAT TTGCCCTCCT GGGTTTATGG GAAGGACGTG 900 

40 TGAGAAGGCT TGTGAACTGC ACACGTTTGG CAGAACTTGT AAAGAAAGGT GCAGTGGACA 960 

AGAGGGATGC AAGTCTTATG TGTTCTGTCT CCCTQACCOC TATGGGTGTT CCTGTGCCAC 1020 

AGGCTGGAAG GGTCTGCAGT GCAATGAAGC ATGCCACCCT GGTTTTTACG GGCCAGATTG 1080 

TAAGCTTAGG TGCAOCTGCA ACAATGGGGA GATGTGTGAT CGCTTCCAAG GATGTCTCTG 1140 

CTCTCCAGGA TGGCAGGGGC TCCAGTGTGA GAGAGAAGGC ATACCGAGGA TGACCCCAAA 1200 

45 GATAGTGGAT TTGCCAGATC ATATAGAAGT AAACAGTGGT AAATTTAATC CCATTTGCAA 1260 

AGCTTCTGGC TGGCCGCTAC CTACTAATGA AGAAATGACC CTGGTOAAGC CGGATGGGAC 1320 

AGTGCTCCAT CCAAAAGACT TTAACCATAC GGATCATTTC TCAGTAGCCA TATTCACCAT 1380 

CCACCGGATC CTCCCCCCTG ACTCAGGAGT TTGGGTCTGC AGTGTGAACA CAGTGGCTGG 1440 

GATGGTGGAA AAGCCCTTCA ACATTTCTGT TAAAGTTCTT CCAAAGCCCC TGAATGCCCC 1500 

50 AAACGTGATT GACACTGGAC ATAACTTTGC TGTCATCAAC ATCAGCTCTG AGCCTTACTT 1560 

TGGGGATGGA CCAATCAAAT CCAAGAAGCT TCTATACAAA CCCGTTAATC ACTATGAGGC 1620 

TTGGCAACAT ATTCAAGTGA CAAATGAGAT TGTTACACTC AACTATTTGG AACCTCGGAC 1680 

AGAATATGAA CTCTGTGTGC AACTGGTCCG TCGTOGAGAG GGTGGGGAAG GGCATCCTGG 1740 

ACCTGTGAGA CGCTTCACAA CAGCTTCTAT CGGACTCCCT CCTCCAAGAG GTCTAAATCT 1800 

55 CCTGCCTAAA AGTCAGACCA CTCTAAATTT GACCTGGCAA CCAATATTTC CAAGCTCGGA 1860 

AGATGACT7T TATGTTGAAG TGGAGAGAAG GTCTGTGCAA AAAAGTGATC AGCAGAATAT 1920 

TAAAGTTCCA GGCAACTTGA CTTCGGTGCT ACTTAACAAC TTACATCCCA GGGAGCAGTA 1980 

CGTGGTCCGA GCTAGAGTCA ACACCAAGGC CCAGGGGGAA TGGAGTGAAG ATCTCACTGC 2040 

TTGGACCCTT AGTGACATTC TTCCTCCTCA ACCAGAAAAC ATCAAGATTT CCAACATTAC 2100 

60 ACACTCCTCG GCTGTGATTT CTTGGACAAT ATTGGATGGC TATTCTATTT CTTCTATTAC 2160 

TATCCGTTAC AAGGTTCAAG GCAAGAATGA AGACCAGCAC GTTGATGTGA AGATAAAGAA 2220 

TGCCACCATC ATTCAGTATC AGCTCAAGGG CCTAGAGCCT GAAACAGCAT ACCAGGTGGA 2280 

CATTTTTGCA GAGAACAACA TAGGGTCAAG CAACCCAGCC TTTTCTCATG AACTGGTGAC 2340 

CCTCCCAGAA TCTCAAGCAC CAGOOGACCT CG3AGGGGOG AAGATGCTGC TTATAGCCAT 2400 

65 CCTTGGCTCT* GCTGGAATGA CCTGCCTGAC TGTGCTGTTG GCCTTTCTGA TCATATTGCA 2460 

ATTGAAGAGG GCAAATGTGC AAAGGAGAAT GGCCCAAGCC TTCCAAAACG TGAGGGAAGA 2520 

ACCAGCTGTG CAGTTCAACT CAGGGACTCT GGCCCTAAAC AGGAAGGTCA AAAACAACCC 2580 

AGATCCTACA ATTTATCCAG TGCTTGACTG GAATGACATC AAATTTCAAG ATGTGATTGG 2640 

GGAGGGCAAT TTTGGCCAAQ TTCTTAAGGC GCGCATCAAG AAGGATGGGT TACGGATGOA 2700 

70 TGCTGCCATC AAAAGAATGA AAGAATATGC CTCCAAAGAT GATCACAGGG ACTTTGCAGG 2760 

AGAACTGGAA GTTCTTTOTA AACTTGGACA CCATCCAAAC ATCATCAATC TCTTAGGAGC 2820 

ATGTGAACAT CGAGGCTACT TGTACCTGGC CATTGAGTAC GCGCCCCATG GAAACCTTCT 2880 

GGACTTCCTT CGCAAGAGCC GTGTGCTGGA GACGGACCCA GCATTTGCCA TTGCCAATAG 2940 

CACCGCGTCC ACACTGTCCT CCCAGCAGCT CCTTCACTTC GCTGCCGACG TGGCCCGGGG 3000 

75 CATGGACTAC TTGAGCCAAA AACAGTTTAT CCACAGGGAT CTGGCTGCCA GAAACATTTT 3060 

AGTTGGTGAA AACTATGTGG CAAAAATAGC AGATTTTGGA TTGTCCCGAG GTCAAGAGGT 3120 
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GTACGTGAAA AAGACAATGG GAAGGCTCCC AGTGCGCTGQ ATGGCCATCG AGTCACTGAA 3180 

TTACAGTOTG TACACAACCA ACAGTGATGT ATGGTCCTAT GGTGTGTTAC TATGGGAGAT 3240 

TGTTAGCTTA OGAGGCACAC CCTACTGCGG GATGACTTGT GCAGAACTCT AOGAGAAGCT 3300 

GCCCCAGGGC TACAGACTGG AGAAGCCCCT GAACTGTGAT GATGAOGTGT ATGATCTAAT 3360 

GAGACAATGC TGGCGGGAGA AOCCTTATGA GAOGCCATCA TTTGCCCAGA TATTGGTGTC 3420 

CTTAAACAGA ATGTTAGAGG AGCGAAAGAC CTAOGTGAAT ACCACGCTTT ATGAGAAGTT 3480 

TACTTATGCA GGAATTGACT GTTCTGCTGA AGAAGOGGCC TAGGACAQAA CATCTGTATA 3540 

CCCTCTGTTT CCCTTTCACT GGCATGGGAG ACCCTTGACA ACTGCTGAGA AAACATGCCT 3600 

CTGCCAAAGG ATGTGATATA TAAGTGTACA TATGTGCTGG AATTCTAACA AGTCATAGGT 3660 

TAATATTTAA GACACTGAAA AATCTAAGTG ATATAAATCA GATTCTTCTC TCTCATTTTA 3720 

TCCCTCACCT GTAGCATGCC AGTCCCGTTT CATTTAGTCA TGTGACCACT CTGTCTTGTG 3780 

TTTCCACAGC CTGCAAGTTC AOTCCAGGAT GCTAACATCT AAAAATAGAC TTAAATCTCA 3B40 

TTGCTTACAA GCCTAAGAAT CTTTAGAGAA GTATACATAA GTTTAGGATA AAATAATGGG 3900 

ATTTTCTTTT CTTTTCTCTG GTAATATTGA CTTGTATATT TTAAGAAATA ACAGAAAGCC 3960 

TGGGTGACAT TTGGGAGACA TGTGACATTT ATATATTGAA TTAATATCCC TACATGTATT 4020 

GCACATTGTA AAAAGTTTTA GTTTTGATGA GTTGTGAGTT TACCTTGTAT ACTGTAGGCA 4080 
CACTTTGCAC TGATATATCA TGAQTGAATA AATGTCTTGC CTACTCAAAA AAAAAAAA 

Seq ID Mot 147 Protein sequence i 
Protein Accession #: KP_0004S0.1 
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I I I I I I 

MDSIASLVLC GV8LLLSGTV EGAMDLILIN SLPL.VSDABT SLTCIASGWR PHEPITIGRD 60 

FBALMNQHQO PLEVTQDVTR EWAKKWWKR EKASKINGAY FCEGRVRGBA IRIRTMKMRQ 120 

QASFLPATLT MTVDKGDNVN ISPKKVLIXE BDAVIYKNGS, FIHSVPRHSV PDILBVHLPH 180 

AQPQDAGVYS ARYIGGNLFT SAPTRLIVRR CKAQKKGPEC KHLCTACKNN GVCHSDTGBC 240 

ICPPGPMGRT CEKACKLHTP GRTCKBRCSG QEGCKSYVFC LPDPYGCSCA TGWKGLQCNE 300 

ACHPGFYGPD CKLRCSCNNG EMCDRFQGCL CSPGWQGLQC ERBGIPRMTP KIVDLPDHIE 360 

VNSGKFNPIC KASGWPLPTN EEMTLVKPDG TVLHPKDPNH TDH7SVAIPT IHRILPPDSG 420 

VWVCSVNTVA GMVEKPFNIS VKVLPKPLNA PNVIDTGHNP AVINISSEPY FGDGPIKSKK 480 

LLYKPVNHYE AWQHIQVTNB IVTLNYLEPR TEYELCVQLV RRGEGGEGHP GPVRRFTTAS 540 

IGLPPPRGLN LLPKSQTTLN LTWQPIFPSS EDDFYVBVBR RSVQKSDQQN IKVPGNLTSV 600 

LLNNLHPREQ YWRARVNTK AQGHWSBDLT AWTL5DILPP QPENIKISKI THSSAVXSWT 660 

ILDGYS1SSI TIRYKVQGKN EDQHVDVKIK NAT1IQYQLK GLEPBTAYQV DIFAENNIGS 720 

SNPAFSHELV TLPESQAPAD LGGGKMLLIA ILGSAGMTCfc TVLLAFLIIL QLKRANVQRR 780 

MAQAFQNVRE EPAVQFNSGT LALNRKVKNN PDPTIYPVLD WNDIKPQDVI GBGNFGGVLK 840 

ARIKKDGIiRM DAAIKRMKEY ASKDDHRDFA GBLBVLCKIia HHPHI1NLI/3 ACEHRGYLYL 900 

AIEYAPHGNL LDFLRKSRVL ETDPAFAIAM STASTLSSQQ LLHFAADVAR GMDYLSQKQF 960 

IHRDLAARNI LVGENYVAKI ADFGLSRGQK VYVKKTMGRL PVRWMAIESL NYSVYTTNSD 1020 

VWSYGVLLWE IVSLGGTPYC GMTCAELYEK LPQGYRLEKP LNCDDEVYDL NRQCWRBKPY 1080 
ERPSFAQILV SLWRMLBBRK TYVNTTLYEK FTYAGIDCSA EEAA 

Seq ID NO: 148 DNA sequence 

Nucleic Acid Accession #: NM 000552.2 

Coding sequence: 311-8752 (underlined sequences correspond to start and stop codons) 
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111)11 

AGCTCACAGC TATTGTGGTG GGAAAGGGAG GGTGGTTGOT GGATGTCACA GCTTGGGCTT 60 

TATCTCCCCC AGCAGTGGGG ACTCCACAGC CCCTGGGCTA CATAACAGCA AGACAGTCCG 120 

GAGCTGTAGC AGACCTGATT GAGCCTTTGC AGCAGCTGAG AGCATGGCCT AGGGTGGGCG 180 

GCACCATTGT CCAGCAGCTG AGTTTCCCAG GGACCTTGGA GATAGCOGCA GCCCTCATTT 240 

GCAGGGGAAG GCACCATTGT CCAGCAGCTG AGTTTCCCAG GGACCTTGGA GATAGCCGCA 300 

GCCCTCATTT ATCATTCCTG CCAGATTTGC CGGGGTGCTG CTTGCTCTGG CCCTCATTTT 360 

GCCAGGGACC CTTTGTGCAG AAGGAACTCG CGGCAGGTCA TCCACGGCCC GATGCAGCCT 420 

TTTCGGAAGT GACTTCGTCA ACACCTTTGA TGGGAGCATG TACAGCTTTG CGGGATACTG 480 

CAGTTACCTC CTGGCAGGGG GCTGCCAGAA ACGCTCCTTC TCGATTATTG GGGACTTCCA 540 

GAATGGCAAG AGAGTGAGCC TCTCCGTGTA TCTTGGGGAA TTTTTTGACA TCCATTTGTT 600 

TGTCAATGGT ACCGTGACAC AGGGGGACCA AAGAGTCTCC ATGCCCTATG CCTC CAAAG G 660 

GCTGTATCTA GAAACTGAGG CTGGGTACTA CAAGCTGTCC GGTGAGGCCT ATGGCTTTGT 720 

GGCCAGGATC GATGGCAGCG GCAACTTTCA AGTCCTGCTG TCAGACAGAT ACTTCAACAA 780 

GACCTGCGGG CTGTGTGGCA ACTTTAACAT CTTTGCTGAA GATGACTTTA TGACCCAAGA B40 

AGGGACCTTG ACCTCGGACC CTTATGACTT TGCCAACTCA TGGGCTCTGA GCAGTGGAGA 900 

ACAGTGGTGT GAAOGGGCAT CTCCTCCCAG CAGCTCATGC AACATCTCCT CTGGGGAAAT 960 

GCAGAAGGGC CTGTGGGAGC AGTGCCAGCT TCTGAAGAGC ACCTCGGTGT TTGCCCGCTG 1020 

CCACCCTCTG GTGGACCCCG AGCCTTTTGT GGCCCTGTGT GAGAAGACTT TGTGTGAGTG 1080 

TGCTGGGGGG CTGGAGTGCG CCTGCCCTGC CCTCCTGGAG TACGCCCGGA CCTGTGCCCA 1140 

GGAGGGAATG GTGCTGTACG GCTGGACCGA CCACAGCGCG TGCAGCCCAG TGTGCCCTGC 1200 

TGGTATGGAG TATAGGCAGT GTGTGTCCCC TTGCGCCAGG ACCTGCCAGA GCCTGCACAT 1260 

CAATGAAATG TGTCAGGAGC GATGCGTGGA TGGCTGCAGC TGCCCTGAGG GACAGCTCCT 1320 

GGATGAAGGC CTCTGCGTGG AGAGCACCGA QTOTCCCTGC GTGCATTCCG GAAAGCGCTA 1380 

CCCTCCCGGC ACCTCCCTCT CTCGAGACTG CAACACCTGC ATTTGCCGAA ACAGCCAOTG 1440 

GATCTGCAGC AATGAAGAAT GTCCAGGGGA GTGCCTTGTC ACTGGTCAAT CCCACTTCAA 1500 
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GAGCTTTGAC AACAGATACT TCACCTTCAG 
TTGCCAGGAC CACTCCTTCT CCATTGTCAT 
CGCTGTQTQC ACCCQCTCCG TCACCQTCOG 
ACTGAAGCAT GGGGCAGGAG TTGCCATGGA 
5 AGGTGACCTC CGCATCCAGC ATACAGTGAC 
CCTGCAGATG GACTGGGATG GCCGCGGGAG 
CGGOAAGACC TGCGGCCTGT GTGGGAATTA 
CCCCTCTGGG CTGGCAGAGC CCCGGGTGGA 
GGACTGCCAG GACCTGCAGA AGCAGCACAG 

10 CAGQTTCTCC GAGGAGGCGT GCGOGGTCCT 
TGCCGTCAGC CCGCTGCCCT ACCTGCGGAA 
CGGCCGCGAG TGCCTGTGCG GOGCCCTGGC 
CGTGCGCGTC GCGTGGCGCG AGCCAGGCCG 
GTACCTGCAG TGCGGGACCC CCTGCAACCT 

15 GGAATGCAAT GAGGCCTGCC TGGAGGGCTG 
GAGGGGGGAC TGCGTGCCCA AGGCCCAGTG 
GCCAGAAGAC ATCTTCTCAG ACCATCACAC 
CTGTACCATG AGTGGAGTCC CCGGAAGCTT 
GTCTCATCGC AGCAAAAGQA GCCTATCCTG 

20 CGCTOACAAC CTGCGGGCTG AAGGGCTCGA 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
TGAGAACAGA TGTGTGGCCC TGGAAAGGTG 
CCCTGGAGAA ACAGTGAAGA TTGGCTGCAA 

25 CTGCACAGAC CATGTGTGTG ATGCCACGTG 
CTTCGACGGG CTCAAATACC TGTTCCCCGG 
CTGCGGCAGT AACCCTGGGA CCTTTCGGAT 
CTCAGTGAAA TGCAAGAAAC GGGTCACCAT 
TGACGGGGAG GTGAATGTGA AGAGGCCCAT 

30 GTCTGGCCGG TACATCATTC TGCTGCTGGG 
CCTGAGCATC TCCGTGGTCC TGAAGCAGAC 
GAATTTTGAT GGCATCCAGA ACAATGACCT 
CCCTGTGGAC TTTGGGAACT CCTGGAAAGT 
GCCTCTGGAC TCATCCCCTG CCACCTGCCA 

35 TTCCTCCTGT AGAATCCTTA CCAGTGACGT 
CGAGCCATAT CTGGATGTCT GCATTTACGA 
CGCCTGCTTC TGCGACACCA TTGCTGCCTA 
GGTGACCTGG AGGACGGCCA CATTGTGCCC 
GAACGGGTAT GAGTGTGAGT GGCGCTATAA 

40 TCAGCACCCT GAGCCACTGG CCTGCCCTGT 
CCCTCCAGGG AAAATCCTGG ATGAGCTTTT 
AGTGTGTGAG GTGGCTGGCC ' GGCGTTTTGC 
TGACCCTGAG CACTGCCAGA TTTGCCACTQ 
CCAGGAGCCG GGAGGCCTGG TGGTGCCTCC 

45 GTATGTGGAG GACATCTCGG AACCGCCGTT 
CCTGGTCTTC CTGCTGGATG GCTCCTCCAG 
GGCCTTTGTG GTGGACATGA TGGAGCGGCT 
CGTGGTGGAG TACCACGACG GCTCCCACGC 
GTCAGAGCTG CGGCGCATTG CCAGCCAGGT 

50 CAGCGAGGTC TTGAAATACA CACTGTTCCA 
CTCCCGCATC GCCCTGCTCC TGATGGCCAG 
TGTCCGCTAC GTCCAGGGCC TGAAGAAGAA 
GCCCCATGCC AACCTCAAGC AGATCCGCCT 
CTTCGTGCTG AGCAGTGTGG ATGAGCTGGA 

55 CTGTGACCTT GCCCCTGAAG CCCCTCCTCC 
TGTGGGCCCO GGGCTCTTGG GGGTTTCGAC 
GGATGTGGCG TTCGTCCTGG AAGGATCGGA 
CAAGGAGTTC ATGGAGGAGG TGATTCAGCG 
CACGOTGCTG CAGTACTCCT ACATGGTGAC 

60 CAAAGGGGAC ATCCTGCAGC GGGTGCGAGA 
CACTGGGCTG GCCCTGCGGT ACCTCTCTGA 
GGAGCAGGCG CCCAACCTGG TCTACATGGT 
GAGGCTGCCT GGAGACATCC AGGTGGTGCC 
GGAGCTGGAO AGGATTGGCT GGCCCAATGC 

65 CCCCCGAGAG GCTCCTGACC TGGTGCTGCA 
CCCCACCCTC TCCCCTGCAC CTGACTGCAG 
TGGCTCCTCC AGTTTCCCAG CTTCTTATTT 
CATTTCAAAA GCCAATATAG GGCCTCGTCT 
CATCACCACC ATTGACGTGC CATGGAACGT 

70 TGTGGACGTC ATGCAGCGGG AGGGAGGCCC 
TGTGCGATAC TTGACTTCAG AAATGCATGG 
CATCCTGGTC ACGGACGTCT CTGTGGATTC 
. CAACAGAGTG ACAGTGTTCC CTATTGGAAT 
GATCTTGGCA GGCCCAGCAG GCGACTCCAA 

75 CCCTACCATG GTCACCTTGG GCAATTCCTT 
GATTTGCATG GATGAGGATG GGAATGAGAA 



TGGGATCTGC CAGTACCTGC TGGCCCGGGA 1S60 

TGAGACTGTC CAGTGTGCTG ATGACCGCGA 1620 

GCTGCCTGGC CTGCACAACA GCCTTGTGAA 1680 

TGGCCAGGAC ATCCAGCTCC CCCTCCTGAA 1740 

GGCCTCCGTG CGCCTCAGCT ACGGGGAGGA 1800 

GCTGCTGGTG AAGCTGTCCC CCGTCTACGC 1060 

CAATGGCAAC CAGGGCGACG ACTTCCTTAC 1920 

GGACTTCGGG AAOGCCTGGA AGCTGCACGG 1980 

CGATCCCTGC GCCCTCAACC CGCGCATGAC 2040 

GACGTCCCCC ACATTCGAGG CCTGCCATCG 2100 

CTGCCGCTAC GACGTGTGCT CCTGCTCGGA 2160 

CAGCTATGCC GCGGCCTGCG CGGGGAGAGG 2220 

CTGTGAGCTG AACTGCCCGA AAGGCCAGGT 2280 

GACCTGCCGC TCTCTCTCTT ACCCGGATGA 2340 

CTTCTGCCCC CCAGGGCTCT ACATGGATGA 2400 

CCCCTGTTAC TATGACGGTG AGATCTTCCA 2460 

CATGTGCTAC TGTGAGGATG GCTTCATGCA 2520 

GCTGCCTGAC GCTGTCCTCA GCAGTCCCCT 2580 

TCGGCCCCCC ATGGTCAAGC TGGTGTGTCC 2640 

GTGTACCAAA ACGTGCCAGA ACTATGACCT 2700 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

TCCCTGCTTC CATCAGGGCA AGGAGTATGC 2820 

CACTTGTGTC TGTCGGGACC GGAAGTGGAA 2880 

CTCCACGATC GGCATGGCCC ACTACCTCAC 2940 

GGACTGCCAG TACGTTCTGG TGCAGGATTA 3000 

CCTAGTGGGG AATAAGGGAT GCAGCCACCC 3060 

CCTGGTGGAG GGAGGAGAGA TTGAGCTGTT 3120 

GAAGGATGAG ACTCACTTTG AGGTGGTGGA 3180 

CAAAGCCCTC TCCGTGGTCT GGGACCGCCA 3240 

ATACCAGGAG AAAGTGTGTG GCCTGTGTGG 3300 

CACCAGCAGC AACCTCCAAG TGGAGGAAGA 3360 

GAGCTCGCAG TOTGCTGACA CCAGAAAAGT 3420 

TAACAACATC ATGAAGCAGA CGATGGTGGA 3480 

CTTCCAGGAC TGCAACAAGC TGGTGGACCC 3540 

CACCTGCTCC TGTGAGTCCA TTGGGGACTG 3600 

TGCCCACGTG TGTGCCCAGC ATGGCAAGGT 3660 

CCAGAGCTGC GAGGAGAGGA ATCTCCGGGA 3720 

CAGCTGTGCA CCTGCCTGTC AAGTCACGTG 3780 

GCAGTGTGTG GAGGGCTGCC ATGCCCACTG 3840 

GCAGACCTGC GTTGACCCTG AAGACTGTCC 3900 

CTCAGGAAAG AAAGTCACCT TGAATCCCAG 3960 

TGATGTTGTC AACCTCACCT GTGAAGCCTG 4020 

CACAGATGCC CCGGTGAGCC CCACCACTCT 4080 

GCACGATTTC TACTGCAGCA GGCTACTGGA 4140 

GCTGTCCGAG GCTGAGTTTG AAGTGCTGAA 4200 

GOGCATCTCC CAGAAGTGGG TCCGCGTGGC 4260 

CTACATCGGG CTCAAGGACC GGAAGCGACC 4320 

GAAGTATGCG GGCAGCCAGG TGGCCTCCAC 4380 

AATCTTCAGC AAGATCGACC GCCCTGAAGC 4440 

CCAGGAGCCC CAACGGATGT CCCGGAACTT 4500 

GAAGGTCATT GTGATCCCGG TGGGCATTGG 4560 

CATCGAGAAG CAGGCCCCTG AGAACAAGGC 4620 

GCAGCAAAGG GACGAGATCG TTAGCTACCT 4680 

TACTCTGCCC CCCCACATGG CACAAGTCAC 4740 

CCTGGGGCCC AAGAGGAACT CCATGGTTCT 4800 

CAAAATTGGT GAAGCCGACT TCAACAGGAG 4860 

GATGGATGTG GGCCAGGACA GCATCCACGT 4920 

CGTGGAGTAC CCCTTCAGCG AGGCACAGTC 4980 

GATCCGCTAC CAGGGCGGCA ACAGGACCAA 5040 

CCACAGCTTC TTGGTCAGCC AGGGTGACCG 5100 

CACCGGAAAT CCTGCCTCTG ATGAGATCAA S160 

CATTGGAGTG GGCCCTAATG CCAACGTGCA 5220 

CCCTATCCTC ATCCAGGACT TTGAGACGCT 5280 

GAGGTGCTGC TCCGGAGAGG GGCTGCAGAT S340 

CCAGCCCCTG GACGTGATCC TTCTCCTGGA 5400 

TGATGAAATG AAGAGTTTCG CCAAGGCTTT 5460 

CACTCAGGTG TCAOTGCTGC AGTATGGAAG 5520 

GGTCCCGGAG AAAGCCCATT TGCTGAGCCT 5580 

CAGCCAAATC GGGGATGCCT TGGGCTTTGC 5640 

TGCCAGGCCG GGAGCCTCAA AGGCGGTGGT 5700 

AGTGGATGCA GCAGCTGATG CCGCCAGGTC 5760 

TGGAGATCGC TACGATGCAG CCCAGCTACG 5830 

CGTGGTGAAG CTCCAGCGAA TCGAAGACCT 5880 

CCTCCACAAA CTGTGCTCTG GATTTGTTAG 5940 

GAGGCCCGGG GACGTCTGGA CCTTGCCAGA 6000 
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CCAGTGCCAC ACCGTGACTT GCCAGCCAGA TGGCCAGACC TTGCTGAAGA GTCATCGGGT 6060 

CAACTQTGAC CGGGGGCTGA OGCCTTOGTG CCCTAACAGC CAGTCCCCTG TTAAAGTGGA 6120 

AGAGACCTGT GGCT GCCGCT GOACCTGCCC CTGCGTGTGC ACAGQCAGCT CCACTCOGCA 6180 

CATQQTGACC TTTGATGGGC AGAATTTCAA GCTGACTGGC AGCTGTTCTT ATGTCCTATT 6240 

5 TCAAAACAAG GAGCAGGAOC TGGAGGTGAT TCTCCATAAT GGTQCCTGCA GCCCTGGAGC 6300 

AAGGCAGGGC TGCATGAAAT CCATCGAGGT GAAGCACAGT OCCCTCTCOG TCGAGCTOCA 6360 

CAGTGACATG GAGGTGACGG TGAATGGGAG ACTGGTCTCT GTTCCTTACG TGGGTGGGAA 6420 

CATGGAAGTC AACGTTTATG GTGCCATCAT GCATGAGOTC AGATTCAATC ACCTTGGTCA 6480 

CATCTTCACA TTCACTCCAC AAAACAATGA GTTCCAACTO CAOCTCAGCC CCAAGACTTT 6S40 

10 TGCTTCAAAO ACGTATGGTC TGTGTGGGAT CTOTGATQAO AACGGAGCCA ATGACTTCAT 6600 

GCTQAGGGAT GGCACAQTCA CCACAGACTQ QAAAACACTT GTTCAGGAAT GGACTGTGCA 6660 

GCGGCCAGGG CAGAOGTGCC AGCCCATCCT GOAGGAGCAG TGTCTTGTCC CCGACAOCTC 6720 

CCACTGCCAG GTCCTCCTCT TACCACTGTT TGCTGAATGG CACAAGGTCC TOGCTCCAGC 6780 

CACATTCTAT GCCATCTGCC AGCAGGACAG TTGCCACCAG GAGCAAGTGT GTGAGGTGAT 6840 

15 CGCCTCTTAT GCCCACCTCT GTCGGACCAA GGGGGTCTGC GTTGACTOOA GGACACCTGA 6900 

TTTCTGTGCT ATOTCATGCC CACCATCTCT GGTCTACAAC CACTGTGAQC ATGOCTGTCC 6960 

CCGGCACTGT GATGGCAAOG TGAGCTCCTG TGGGGACCAT CCCTCCGAAO GCTGTTTCTG 7020 

CCCTCCAGAT AAAGTCATGT TGGAAGGCAG CTGTGTCCCT GAAGAGGCCT GCACTCAGTG 7080 

CATTGGTGAG GATGGAGTCC AGCACCAGTT CCTGGAAGCC TGGGTCCCGG ACCACCAGCC 7140 

20 CTGTCAGATC TGCACATGCC TCAGCGGGCG GAAGGTCAAC TGCACAACGC AGCCCTGCCC 7200 

CACGGCCAAA GCTCCCACGT GTGGCCTGTG TGAAGTAGCC CGCCTCCGCC AGAATGCAGA 7260 

CCAGTGCTGC CCCGAGTATG AGTGTGTGTG TGACCCAGTG AGCTGTGACC TGCCCCCAGT 7320 

GCCTCACTGT GAACGTGGCC TCCAGCCCAC ACTGACCAAC CCTGGCGAGT GCAGACCCAA 7380 

CTTCACCTGC GCCTGCAGGA AGGAGGAGTG CAAAAGAGTG TCCCCACCCT CCTGCCCCCC 7440 

25 GCACOGTTTG CCCACCCTTC GGAAGACCCA GTGCTGTGAT GAGTATGAGT GTGCCTGGAA 7500 

CTGTGTCAAC TCCACAGTGA GCTGTCCCCT TGGGTACTTG GCCTCAACCG CCACCAATGA 7560 

CTGTGGCTGT ACCACAACCA CCTGCCTTCC CGACAAGGTG TGTGTCCACC GAAGCACCAT 7620 

CTACCCTGTG GGCCAGTTCT GGGAGGAGGG CTGCGATOTG TGCACCTGCA CCGACATGGA 7680 

GGATGCCGTG ATGGGCCTCC GCGTGGCCCA GTGCTCCCAG AAGCCCTGTG AGGACAGCTG 7740 

30 TCGGTCGGGC TTCACTTACG TTCTGCATGA AGGCGAGTGC TGTGGAAGGT GCCTGCCATC 7800 

TGCCTGTGAG GTGGTGACTG GCTCACCGCG GGGGGACTCC CAGTCTTCCT GGAAGAGTGT 7860 

OGGCTCCCAG TGGGCCTCCC OGGAGAACCC CTGCCTCATC AATGAGTGTG TCCGAGTGAA 7920 

GGAGGAGGTC TTTATACAAC AAAGGAACGT CTCCTGCCCC CAGCTGGAGG TCCCTGTCTG 7980 

CCCCTGGGGC TTTCAGCTGA GCTGTAAGAC CTCAGCGTGC TGCCCAAGCT GTCGCTGTGA 8040 

35 GCGCATGGAG GCCTGCATGC TCAATGGCAC TGTCATTGGG CCCGGGAAGA CTGTGATGAT 6100 

CGATGTGTGC ACGACCTGCC GCTGCATGGT GCAGGTGGGG GTCATCTCTG GATTCAAGCT 8160 

GGAGTGCAGG AAGACCACCT GCAACCCCTG CCCCCTGGGT TACAAGGAAG AAAATAACAC 8220 

AGGTGAATGT TGTGGGAGAT GTTTGCCTAC GGCTTGCACC ATTCAGCTAA GAGGAGGACA 8280 

GATCATGACA CTGAAGCGTG ATQAGACGCT CCAGGATGGC TGTGATACTC ACTTCTGCAA 8340 

40 GGTCAATGAG AGAGGAGAGT ACTTCTGGGA GAAGAGGOTC ACAGGCTGCC CACCCTTTGA 8400 

TGAACACAAG TGTCTGGCTG AGGGAGGTAA AATTATGAAA ATTCCAGGCA CCTGCTGTGA 8460 

CACATGTGAG GAGCCTGAGT GCAACGACAT CACTGCCAGG CTGCAGTATG TCAAGGTGGG 8520 

AAGCTGTAAG TCTGAAGTAG AGGTGGATAT CCACTACTGC CAGGGCAAAT GTGCCAGCAA 8580 

AGCCATGTAC TCCATTGACA TCAACGATGT GCAGGACCAG TGCTCCTGCT GCTCTCCGAC 8640 

45 ACGGAOGGAG CCCATGCAGG TGGCCCTGCA CTGCACCAAT GGCTCTGTTG TGTACCATGA 8700 

GGTTCTCAAT GCCATGGAGT GCAAATGCTC CCCCAGGAAG TGCAGCAAG T GAO GCTGCTG 8760 

CAGCTGCATG GGTGCCTGCT GCTGCCTGCC TTGGCCTGAT GGCCAGGCCA GAGTGCTGCC 8820 

AGTCCTCTGC ATGTTCTGCT CTTGTGCCCT TCTGAGCCCA CAATAAAGGC TGAGCTCTTA 8880 
TCTTGCTGCA TGTTCTGCTC TTGTGCCCTT CTGAGCCCAC AAT 



50 



Seq ID No: 149 Prqfrejn peqmfflcg: 
Protein Accession <h NP 0OOS43.1 



55 l 11 21 31 41 51 

I I I I I I 

MIPARFAGVL LALALILPGT LCAEGTRGRS STARCSLPGS DFVNTFDGSM YSPAGYCSYL 60 

LAGGCQKRSF SIIGDPQNGK RVSLSVYLGE FFDIHLFVNG TVTQGDQRVS MPYASKGLYL 120 

BTEAGYYKLS GEAYGFVARI DGSGNPQVLL SDRYFNKTCG liCGNFNIPAE DDFMTQBGTL 180 

60 TSDPYDFANS WALSSGEQWC ERASPPSSSC NISSGEMQKO LWEQCQLLKS TSVFARCHPL 240 

VDPEPFVALC EKTLCECAGG LECACPALLE YARTCAQEGM VLYGWTDHSA CSPVCPAGME 300 

YRQCVSPCAR TCQSLHINEM CQKRCVDGCS CPEGQLLDEG LCVBSTECFC VHSGKRYPPG 360 

TSLSRDCNTC ICRNSQWICS NEECPGECLV TGQSHFKSPD HRYFTFSGIC QYLLARDCQD 420 

HSFSIVIETV QCADDRDAVC TRSVTVRLPG LHNSLVKLKH GAGVAMDGQD IQLPLLKGDL 480 

65 RIQHTVTASV RLSYGEDLQM DWDGRORUiV KLSPVYAGKT CGLCGNYNGN QGDOPLTPSG 540 

LAEPRVBDFG NAWKLHGDCQ DLQKQHSDPC ALNPRMTRFS EEACAVLTSP TPEACHRAVS 600 

PLPYIiRNCRY DVCSCSDGRB CLCGALASYA AACAGRGVRV AHREPGRCEL NCPKGQVYLQ 660 

CGTPCNLTCR SLSYPDEECN BACLEGCFCP PGLYMDERGD CVPKAQCPCY YDGE1FQPED 720 

IPSDHHTMCY CEDGFMKCTM SQVPGSLLPD AVLSSPLSHR SKRSLSCRPP MVKLVCPADN 780 

70 LRAEGLECTK TCQNYDLECH SMGCVSGCLC PPGMVRHENR CVALBRCPCF HQGKEYAPGE 840 

TVKIGCNTCV CRDRKWNCTD HVCDATCSTI GMAHYLTFDG LKYLFPGBCQ YVLVQDYCGS 900 

NPGTFRILVG NKGCSHPSVK CKKRVTILVE GGBIBLFDGE VNVKRPMKDE THFEWESGR 960 

YIIUiLGKAL SWWDRHWI SWLKQTYQE KVCGLCGNFD GIQNNDLTSS NLQVEBDPVD 1020 

PGNSWKVSSQ CADTRKVPLD SSPATCHNNI MKQTMVDSSC RILTSDVFQD CNKLVDPBPY 1080 

75 LDVCIYDTCS CESIGDCACP CDTIAAYAHV CAQHQKWTW RTATLCPQSC EBRNLRENGY 1140 

ECEWRYNSCA PACQVTCQHP BPLACPVQCV BGCHAHCPPG KILDELLQTC VDPBDCPVCB 1200 
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10 



15 



20 



25 



VAGRRPASGK 
DISEPPIJIDF 
YHDGSHAYIG 
ALLUASQEP 
SSVDELKQQR 
FVLEGSDKIG 
ILQRVREIRY 
GDIQWPIGV 
SPAPDCSQPL 
IDVPWNWPB 
TDV8VDSVDA 
VTl/JNSFIiHK 
RGLRPSCPNS 
EQDLEVILHN 
NVYGAIMHBV 
GTVTTDWKTL 
AICQQDSCHQ 
DGNVSSCGDH 
CTCLSGRKVN 
ERGLQPTLTN 
STVSCPLGYti 
HQLRVAQCSQ 
WASPBNPCXI 
ACMLNGTVIG 
CGRCLPTACT 
CLAEGGKIMK 
SIDINDVQDQ 



KVTLNPSDPE 
YCSRLIiDLVP 
LKDRKRPSEL 



HCQICHCDW 



DEIVSYLCDL 
EADFNRSKSF 
QGGNRTNTGL 
GPNANVQELE 
DVILLLDGSS 
KAHLLSLVDV 
AADAARSNRV 
LCSOFVRICM 
QSPVKVEETC 
GACSPGARQG 
RPNHLGHIFT 
VQKWTVQRPG 
BQVCBVIASY 
PSEGCFCPPD 
CTTQPCPTAK 
PGECRPNFTC 
ASTATNDCGC 
KPCEDSCRSG 
NECVRVKEEV 
PGKTVMIDVC 
IQLRGGQIMT 
IPGTCCDTCE 
CSCCSPTRTE 



SRIASQVKYA 
VQGUCKKKVI 
APEAPPPTLP 
KSEVTQRMDV 
ALRYIiSDHSP 
RIGWPNAPIL 
SFPASYFDEM 
MQRBGGPSQI 
TVPPIGIGDR 



GCRWTCPCVC 
CMKSIBVKKS 
PTPQNNEPQL 
QTCQPILEEQ 
AHLCRTNGVC 
KVMLBGSCVP 
APTCGLCEVA 
ACRKEECKRV 
TTTTCLPDKV 
PTYVLHEGEC 
PIQQRNVSCP 
TTCRCMVQVG 
LKRDETLQDG 
BPBCNDITAR 
PMQVALHCTN 



NLTCEACOSP 
ABFEVfcKAPV 
GSQVASTSBV 
VIPVGIGPHA 
PHMAQVTVGP 
GQDSIHVTVL 
LVSQGOREQA 
IQDFETLPRE 
KSPAKAFISK 
GDALGPAVRY 
YDAAQLRXLA 
DVWTLPDQCH 
TGSSTRHIVT 
ALSVEliHSDM 
OLSPKTFASK 
CLVPDSSHCQ 
VDWRTPDPCA 
BBACTQCIGE 
RLRQNADQCC 
SPPSCPPHRL 
CVHRBTIYPV 
CGRCLPSACB 
QLEVPVCPSG 
VISQPKLECR 
CDTHPCKVNB 
LQYVKVGSCK 
GSWYHEVLN 



GGLWPPTDA 
VDMMBRLRIS 
IdCYTLFQIFS 
NUCQIRLIEK 
GLLGVSTLGP 
OYSYMVTVBY 
PNLVYMVTGN 
APDLVLQRCC 
ANIGPRLTQV 
LTSEMHGARP 
GPAGDSNWK 
TVTCQPOGQT 
FDGQNFKLTG 
EVTVNGRLVS 
TYGLCGICDB 
VLLLPLPAEC 
MSCPPSLVYN 
DGVQHQFLEA 
PBYECVCDPV 
PTLRKTQCCD 
GQFWEBGCDV 
WTGS PRODS 
PQIiSCKTSAC 
KTTCNPCPLG 
RGBYFWBKRV 
SBVEYDIHYC 
AMECKCSPRK 



PVSPTTLYVE 
QKWVRVAWB 
KIDRPBASRI 
QAPENKAFVL 
KRNSMVLDVA 



PASDEIKRLP 
SGEGLQIPTL 
SVLQYGSITT 
GASKAWILV 
LQRIEDLPTM 
LLKSHRVKCD 
SCSYVLFQNK 
VPYVGGNMEV 
NGANDFMLRD 
HKVLAPATFY 



WVPDHQPCQI 
SCDIiPPVPHC 
BYECACNCVN 
CTCTDMBOAV 
QSSWKflVGSQ 



TGCPPPDEHK 
QGKCASKAMY 
CSK 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



30 Seq ID NO: 150 DHA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1-1362 (underlined sequences correspond to start and stop codons) 



35 



40 



45 



50 



55 



60 



1 11 21 31 41 

1 I I I I 

ATGGCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA 
CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA 
TTCGTGATGG GOCTTCTGGG GAACAGCGTC ACCATTGGGG TCACCCAGGT 
AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG 
TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGOAA 
ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC 
GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT 
TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG 
GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA 
GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCO 
CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG 
CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC 
ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAGCCAGA AGGGCTCGCT 
AOGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC 
ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT 
ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAGCACGACT GGACGAGGTC 
GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC 
CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA 
TGCCGCCTGT CGCTGCAGCA COCCAACCAC GAGAAGCGCC TGCGCGTACA 
ACCACCGACA GCGCCCGCTT TQTGCAGCGC CCGTTGCTCT TCGCGTCCCG 
TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGOGAGGC 
TCTAAGTCCC AGTCATTGAO TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC 
AATTCTGCTG CAGAGAATGG TTTTCAGGAQ CATGAAGT TT GA 

Seq ID No: 151 Protean, sequence; 
Protein Accession #: NP_001499.1 



51 
I 

CAGTCATGTC 
CCTGATCATC 
GCTGCAGAAG 
CTCGGACATC 
TCCCCTGACC 
CTGCAGCTAC 
CTGTCACOCC 
CTTCOTCTGG 
OTACCCCCTG 
CCACCACGAG 
OACCGTGTTC 
CGTAGCCTTC 
GGCCGGGGGC 
CAGGAGGCAG 
GCCCAACCAG 
CTACTTCCGG 
GGTCATCAAC 
GGTGCTGTGC 
TGCGCACTCC 
GCGCCAGTCC 
CGAGCCCCAG 
GAAACCAGCC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



65 



70 



75 



1 11 21 31 41 51 

I I I I I I 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVKGLIjGNSV TIRVTQVIjQK 60 

KGYIiQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 120 

ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKliilGFVW VTSALVALPL LPAMGTSYPL 180 

VMVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVP QSSIFGAPW YLWLLSVAF 240 

MCWNMMQVLM KSQKGSIAGG TRPPQLRKSE SEESRTARRQ TIIPLRLIW TLAVCWMPNQ 300 

IRRIMAAAKP KHDWTRSYPR AYMXLLPFSE TFFYLSSVTN PLLYTVSSQQ FRRVFVQVLC 360 

CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 

SKSQSIjSLES lepnsgakpa nsaaengpqe hev 

Seq ID NO: 152 pNfr sequence 
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WO 02/079492 



PCT/US02/04915 



Nucleic Acid Accession 8: none found 

Coding sequence: 3-6S (underlined sequences correspond to start and stop codons) 



1 
I 

TTATTATTTT 
C TTGAA AAGT 
CATOTAATTG 
CACATGTTAT 
CCOGCTCTTT 
TAAATSACAA 
ACATACTACC 
ATACGTTTAA 
GAGAQAAAOA 
AATATTAATA 
GAAGCCAGCT 
CACTATTTTQ 
GTGGTTATAA 
6TAAOCCAAT 
AATCAACTAG 
GTCATOAGTT 
TTGGCCTAGA 
CCCGTTGCCT 
ATGOAAAATT 
ATATTTTGAG 
TGTAATATTT 



11 
I 

GTGTAAACTA 
ATTCCAAGGA 
AAATTCATGC 
CAACCTCGTA 
GGGAATGCTA 
AACACAGCAG 
AGAGCTGCTT 
ATTTGCCCTA 
AAGGAOKAAA 
AATCATAATA 
GTTGGTAGGC 
CCAAAACCAA 
AGGGCATATT 
TTTACTTCTT 
ACAGTGGTTT 
CTTGAATCCC 
GAAGTOGCCA 
TCTGAAAAAC 
AATTTATTAA 
ATAAAATGTT 
AATTATTTTA 



21 
I 

TATTCTGCTT 
ATATTATGAA 
AAGGAAACAA 
ACTTTTGGTG 
CATACCC ATT 
TGTTTTGAGG 
GGAAAATTAA 
GGATTSAGCT 
CAGTOGTAAT 
TGACAAGACC 
ATTAATGAOT 
AGTAATTATA 
TACATAAATT 
TAAAAAGTCT 
GTTAAATTTA 
AGAGAAATAA 
TTTTATCAAC 
AGCAAGTTAT 
TTAATAGCCT 
GAATAAAACC 
TAAGTTTTAT 



31 
I 

ATAGAGAGTC 
AATAGGGCAA 
CTCATAGAAA 
CTTGCTGAAT 
TCTGGTATTT 
GAGAAAGGAC 
AGGCCACTTG 
AACAGCAAAA 
AAAAAAATCC 
CTCACTGAAT 
TTAAAATTGT 
ATACTGTGTC 
CTACFTTATT 
CAATTCAAGC 
GCAGCATACT 
TGCTTAGGAA 
AGGRAAAAAA 
TTCTTTATAT 
ATTATGTGTT 
ATGGATTATA 
AATAAAGTAT 



41 

I 

TCTOAGACTA 
CATGGACTGT 
AGATAAATAT 
CAGTCCATGA 
AAAAAATATC 
CATCATTTAT 
TGGCTTTTTC 
AAAAAAAAAA 
ATCTGTCTTC 
AAGAGTATTT 
TCTCAATTGA 
CTCCTGTAAT 
CCTCAACTTC 
TGGATTAGCC 
TCGTTCCCAT 
CTTCTCTCAA 
AAAATTTTCT 
AATTATCATT 
CTCACTTGCT 
GAGAAAAGTC 
TCCATTTCTT 



AAATTGACAA 


60 


TTAAGATCTC 


120 


GGATGCCCTT 


180 


AAAGCTACAG 


240 


TAGGAGGAGC 


300 


AATGCTCTGT 


360 


CTACCAACTG 


420 


AAAAAAAARA 


480 


TTGCTATGTT 


540 


TCAGTCATCA 


600 


AAAAACATCA 


660 


TTTTTGAGAA 


720 


TTTGATGAAT 


780 


AGCTCAGCAT 


840 


TCTAATTAAA 


900 


TCTGCTTGGC 


960 


CTACTACAAC 


1020 


TTATTATTTT 


1080 


TCTCTAAGTA 


1140 


AAAATATATG 


1200 


TATCTT 





Seq ID Not 153 protein sequence: 
Protein Accession #: none found 



1 11 21 

I I I 

IILCKLYSAY RESLRLKLTT 



31 
I 



51 
I 



Seq ID NO: 154 DNA sequence 
Nucleic Acid Accession #t none found 
Coding sequence: 1-36 (underlined 



sequences correspond to start and stop codons) 



CTGGATQATA 
CTCAAGGTTA 
ATCTTTGAAT 
AAATGAAGAG 
TGTTGTGTAG 
CCCTGTACAA 
TTCCTGGGAG 



11 
I 

TGGAAGAAAT 
AAATAGTTTA 
GGTCCCCTTG 
GAATTCACTA 
TTCTCTGTTC 
CCATACTGCC 
ATATAAAATA 



21 
I 

GGAT0G6TTA 
AGT6CCAGAA 
GTTAAATACT 
ATATGCTACG 
CAGGGCTGCC 
TCTCAACACT 
CATAGGTTTA 



31 



41 



51 
I 



AGGTAAAAGG CTGATCACAG ATGGGTTCCT 60 

GAAAAGGTGG GCACCAGCGA ATTAAGAACC 120 

TAACTTTTGT CATCAGTGTC TGCATTTATG 180 

TGATCTTTTG TTTGTCATGA AAAGAGTTAC 240 

TTTGCTCCAC AAAGCACTGA GAAGCAGTGG 300 

GTGTAATAGG CTAACACCGC CCAGCGAACC 360 
GGCTGGCAAA AAAAAAAAAA AAA 



Seq ID NO i 155 PrQtgjn. sequence: 
Protein Accession #: none found 



LDDMEEMDGL R 



21 
I 



41 

I 



51 
I 



Seq ID NO: 1S6 DNA sequence 

Nucleic Acid Accession #: NH.032961.1 

Coding sequence: 827-3949 (underlined sequences correspond to start and stop codons) 



1 

I 

CAGGCTCAGA 
GGGAAAGAGG 
GCAAGTAACG 
ATAGAGCGAA 
GAAAGTTTTA 
TTAAGCCCTT 
GCATCTCCTT 



11 

I 

GGCTGAAGCA 
CTTGGGAAGA 
GTGGAGA7GA 
AGAGGAAAAA 
AACTGCAGAG 
TTGCATTTGC 
TTCCTTCTCC 



21 
I 

GGAGGAAGGA 
AAACAGCAGA 
GGACAGAGGG 
AATGTCAAGA 
CCGTTCTGTG 
CAGCCGTTGA 
TCTTCCTCTT 



31 
I 

AGGACTGGAA 
AAAGAAACTG 
AACCAAGACT 
AGAACATCCA 
CTTTTCCOGC 
CATrAAGAGG 
CTTCTTCTTC 



41 



51 



GGAAAAAGAG ACAGGTTAGA 60 

CTCATTACAC TTACAGAGAG 120 

CTGAAAGACA AAAAATACAA 180 * 

TCCGGAGAAA TGAAGAGAAT 240 

ACAAAATTAT ATCGCTGATT 300 

CATGTTTAAC GGTGCCAACA 360 

CTCCTCCTCC TCCTCTTTTT 420 
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WO 02/079492 
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CCTCCTCCTC GTTCTCCTCC CATCAGCAAG 
GAAATTTCCT CTTTGGGATT TGCCAOCGCC 
TGTATTATTG TTATTTTATT AATTAGTCAG 
CCTGTCACCC TTCCTGTGCT AAGATTTAAA 
5 AAAA TQAACC AAAAGGAGTA AGATTTTTAA 
CGCACTTTTA TTTOTATTTT TTCAGATTTT 
TGGGTGGCTG ACTGGCTGCQ GGAAGCTACT 
ATTGTTTGCC TTGCTCTGGA TGGTQGAAGG 
OGAGGAGCAQ GAACATGGCA CTTTCGTGOG 

10 TACAAAACTT TCGGCTOGCG GGTTTCAGAC 
CCTCAACCTG GAGACAGGGG TGCTGTACQT 
CAAACAQAGC CCCTCCTGTG TCCTGCACCT 
OTTCCAGGTO GAGATCGAGG TGCTOOACAT 
AGACCTGAOG GTGGAAATCT CTGAGAGCGC 

15 CGCATTCGAC CCAGACOTGG GCACCAACTC 
CTACTTCTCC CTGGACGTGC AGACCCAQGG 
GOAGAAGCCA CTGGACCGAG AGCAGCAAGC 
CGGAGGAGGT GGGGGAGGAG TAGGAGAAGO 
CCCCCAGCAG CAGCGCACCG GCACGGCCCT 

20 CAATGTGCCC OCTTTCGACC AACCCGTCTA 
AGGCACTCTC GTGATCCAGC TCAACGCCAC 
CGTGTACTCC TTCAGCAGCC ACATTTCGCC 
GCGCACTGGC AGACTGGAGG TAAGCGGCGA 
AGTGTACGTG CAAGCCAAGG ACCTGGGCCC 

25 AGTGCGAGTA CTGGATGCTA ATGACAACGC 
AGCGGTGAGT GAGGGCGCGG CGCCCGGCAC 
CGACTCAGAG GAGAATGGGC AGGTGCAGTG 
CAAGTCTTCC TTTAAGAATT ACTACACCAT 
GGGGGACTCC TACACCCTGA CTGTAGTGGC 

30 CAGTAAGTCG ATCCAGGTAC AAGTGTCGGA 
GCCGGTCTAC GACGTGTATG TGACTGAAAA 
GAGCGCCACC GACCGGGATG AOGGCGCCAA 
CCAGATCCAG GGCATGAGCG TCTTCACCTA 
GTACGCCCTG CGCTCCTTCG ACTATGAGCA 

35 CCGGGAOGCT GGCAGCCCCC AGGCGCTGGC 
GGATCAAAAT GACAAOGCCC CTGCCATCGT 
AGOGCX3TGAG GTGCTGCCCC GCTGGGCGGA 
CGTGGACGCC GACGACGGCG AGAACGCCCG 
AATGAACCTC TTTCGCATGG ACTGGCGCAC 

40 GGCCAAGCGC GACCCCCAGC GGCCTTATGA 
GCCGCCCCTT TCCTCCACCG CCACCCTGGT 
CCAGGGCGGG GGCGGGAGCG GAGGCGGAGG 
TGGCGGCGGG GAAACCTCGC TAGACCTCAC 
GTCCTTCATC TTCCTGCTGG CCATGATCGT 

45 GCTCAACATC TATACTTGTC TGGCCAGCGA 
CGGAGGTTCG ACCTGCTGTG GCCGCCAAGC 
AGACATCATG CTGGTGCAGA GCTCCAATGT 
GGAGTCCGGG GGCTTTGGCT CCCACCACCA 
GACCCCTGAG TCCGCCAAGA CCGACCTGAT 

50 TACGGACACT GAGCACAACC CCTGCGGGGC 
TGATATCATC TCCAACGGAA GCATTTTGTC 
CAGCTATCTA GTTGACAGAC CTCGCCGAGT 
AGTAAGCTCT AAGGACAGTG GTCATGGAGA 
CACCAACCGT GCCCAGTCAG CTGGTATOGA 

55 AGCTCTGGGC CACTCAGATC GGTGCTGGAT 
GGCTGCTGAT TATCGCAGCA ATCTGCATGT 
GGTGTTTGAA ACTCCAGAAG CCCAGCCTGG 
AGAGAAGGCC CTTCACAGCA CTCTGGAGAG 
GCGAGCGCCT TACAAACCAC CATATTTGAC 

60 GGACTTACCT GAAGCAGCAT GATTTGCACA 
ACTTCATTAT CTTGGCCATC CAGTTAOTCA 
GTCATCATGG CCAATTATAG GACCTAATTG 
TGTGCAGAAC TGTAGAAACT TTAGAGGCAA 
TGTTTACAGC ACTATCTATC TTTCTCTCTC 

65 ATTCACCACG AGAAGCCAGT CATAAAGATA 
CACTGTTTTA AACTTGACTG TTTTATATTA 
TTCCAACTTT ACAAGAGAAA TTGTGATTAT 
TTGTATTCTG AAGACCCACA AAATATCAAA 
AAGTGTTTAC TGTACTATTT CAAAGCTTCT 

70 TATAATTTTC CTAAAATGTG GTACAACTCA 
CATCATACAA TAAAATAAAA GGTAATTCAG 

TCATTAATAG ttttctccca atttccatat 

AGAAAATGAT GCTCTAAGCT ACAAAATTTT 
AAAGATGTAG CTATTGATGT TATCAGACAG 
75 ACAATCTGCA TAAGTCTGAT TCTATTTCTA 
TTATAAAGAA TCGATAAATT CACCTGTATT 



AAGACAAACC GAGGACAGTC TTGAAATATC 480 

AAGACTGTCG GAATAAAGGA CGCTGACTAT 540 

TGGAAAGATT ACAGATGAGG AAAGGGGACG 600 

AAAAAATGAG GCTGGATTGC GGGAAGCTCT 660 

AGACAGAAAG CCACAGGAGC CCCCACGTAG 720 

TTTTTGTTTC GTGGTGGTGG GGGAGGTGAT 780 

TCCTTTCCTT TTGGAGATGA TTGTGCTATT 840 

AGTCTTTTCC CAGCTTCACT ACACGGTACA 900 

GAATATCGCT GAAGATCTGG GTCTGGACAT 960 

GGTGCCCAAC TCAAGGACCC CTTACTTAGA 1030 

GAAOGAGAAA ATAGACCGCG AACAAATCTG 1080 

GGAGGTCTTT CIGGAGAACC CCCTGGAGCT 1140 

TAATGACAAC CCCCCCTCTT TCCCOGAGCC 1200 

CAOGCCAGGC ACTOGCTTCC CCTTGGAGAG 1260 

CTTGCGCGAC TACGAGATCA CCCCCAACAG 1320 

GGATGGCAAC CGATTCGCTG AGCTGGTGCT 1380 

GGTGCACCGC TACGTGCTGA CCOOGGTGGA 1440 

AGGGGGAGGT GGCGGGGGAG CAGGCCTGCC 1S00 

ACTCACCATC CGAGTGCTGG ACTCCAATGA 1560 

CACTGTGTCC CTACCAGAGA ACTCTCCCCC 1620 

CGACCOGGAC GAGGGCCAGA AOGGTGAGGT 1680 

CCGGGCGOGG GAGCTTTTCG GACTCTCGCC 1740 

GTTGGACTAT GAAGAGAGCC CAGTGTACCA 1800 

CAACGCCGTG CCTGCGCACT GCAAGGTGCT I860 

GCCAGAGATC AGCTTCAGCA CCGTGAAGGA 1920 

TGTGGTGGCC CTTTTCAGCG TGACTGACCG 1980 

CGAGCTACTG GGAGACGTGC CTTTCCGCCT 2040 

CGTTACCGAA GCCCCCCTGG ACCGAGAGGC 2100 

TCGGGACCGG GGCGAGCCTG CGCTCTCCAC 2160 

TGTGAAOGAC AAOGCGCCGC GTTTCAGOCA 2220 

CAACGTGCCT GGCGCCTACA TCTACGCGGT 2280 

CGCCCAGCTT GCCTACTCTA TCCTCGAGTG 2340 

CGTTTCTATC AACTCTGAGA ACGGCTACTT 2400 

GCTGAAGGAC TTCAGTTTTC AGGTGGAAGC 2460 

TGGTAACGCC ACTGTCAACA TCCTCATAGT 2520 

GGCGCCTCTA CCAGGGCGCA ACGGGACTCC 2580 

GCCGGGTTAC CTGCTCACCC GCGTGGCCGC 2640 

GCTCACTTAC AGCATCGTGC GTGGCAACGA 2700 

CGGGGAGCTG CGCACAGCAC GCCGAOTCCC 2760 

GCTGGTGATC GAGGTGCGCG ACCATGGGCA 2820 

GGTTCAGCTG GTGGATGGGG CCGTGGAGCC 2880 

GTCAGGAGAG CACCAGCGCC CCAGTCGCTC 2940 

CCTCATCCTC ATCATCGCGT TGGGCTCGGT 3000 

GCTGGCCGTG CGTTGCCAAA AAGAGAAGAA 3060 

TTCJCTGCCTC TGCTGCTGCT GCTGCGGTGG 3120 

CCGGGCGCGC AAGAAGAAAC TCAGCAAGTC 3180 

ACCCAGTAAC CCGGCCCAGG TGCCGATAGA 3240 

CAACCAGAAT TACTGCTATC AGGTATGCCT 3300 

GTTTCTTAAG CCCTGCAGCC CTTCGCGGAG 3360 

CATCGTCACC GGTTACACCO ACCAGCAGCC 3420 

CAACGAGACT AAACACCAGC GAGCAGAGCT 3480 

TAACAGTTCT GCATTCCAGG AAGCCGACAT 3540 

CAGTGAACAG GGAGATAGTG ATCATGATGC 3600 

TCTCTTCTCC AATTGCACTG AGGAATGTAA 3660 

QCCTTCTTTT GTCCCTTCTG ATGGACGCCA 3720 

TCCTGGCATG GACTCTGTTC CAGACACTGA 3780 

GGCAGAGCGG TCCTTTTCCA CCTTTGGCAA 3840 

GAAGGAGCTG GATGGACTGC TGACTAATAC 3900 

ACGGAAAAGG ATATGCTAGT CAATTCTACA 3960 

AAGTCGACCA ACAAAAGCAT CAACTTTTCA 4020 

TGTGTAACTG AGTATTAGAT TTCGGATGGA 4080 

CTCTCAGCAG GCCTGAGAAA TGAGTTGAAA 4140 

CAGATTTTGC CTCCCCGATC AGTGTGTGCC 4200 

CAAATGTCAC TGAGCCCTTT AGATGTTTAT 4260 

AAGGAAATTT GTGCATTATA AATGCAATAT 4320 

TTTTTGTGTG ATCAAGTGTT CCGCAAGCTA 4380 

QTTCTTTTCA CCTGTGGGTT ATAAAAAATG 4440 

GACATTCTGT AGTTTATACA CCGTGTTGCA 4500 

AAATAAATAT AAAATATATA TATTATATTA 4560 

QTTGGTTTTT AAATGGATGC ATACAGTCCA 4620 

GGTCCCAAAG. ACAAACTTAC TAAGAAAAAA 4680 

CTTACTCAAC CGTGTTTTTC CTTGTTTAAA 4740 

GTCAAAAACT CATATTGAAT TTTCAATGCC 4800 

AGCACTGACT ATGTACTATC AAACTATCTA 4860 

TGACTTTGAA TTTAGAATCA CTTAAAGCTT 4920 

TGTTGTTAGA AAAAAACTGG GTGTCTGTAC 4980 
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A TTT T QTO OT GTAAAATATG TAATTGAAGA TTACTATTTT AAGAAGTCAT CAGTCATATC 5040 

ACTCACACAQ AATTTTATTT TACATAOTTT TGTGACTTAA TTACACATQA ATATAAAATC 5100 

TATAATTCTA TATGAATATA TAGAGATATA OAAACATCTG AACTGGTAAA GAATAACTAT 5160 

AAAATATGAA AGCTCTAAAT TTAAAATAAA TTTAGAGATA GAATCATGGT ACATTATTGT 5220 

TTCAGTATTC CATGTAAAAA TTTTATAGCT TAAATGTAGT CAGTGTTTGA TTAATGAAAA 5280 

AATTCTTCAT GAGTCAGCCT TCAAAAGTTA AGCTTGCCTT TTACTTTTAT GTCAACAATA 5340 
TTAATTATTA AATTTAGTAA GACGCAAAAA AAAAAAAAAA AAAA 

Seq ID No: 157 Protein sequence* 
Protein Acceosion fh NP_116586.1 



1 11 21 31 41 51 

I I I I I I 

NXVLLLFALL WMVBGVFSQL HYTVQEBQEH GTFVGNIABD LGLDITKLSA ROFQTVPNSR 60 

TPYLDLNLET GVLYVNBKID REQICKQSPS CVLHLEVPLE NPLELFQVEI EVLDINDNPP 120 

SPPEPDI/TVB ISESATPGTR PPLESAPDPD VGTNSLRDYE ITPNSYFSLD VQTQGDGNRF 180 

ABLVLBKPLD REQQAVHRYV LTAVDGGGGG GVGEGGGGGG GAGLPPQQQR TGTALLTIRV 240 

LDSNDNVPAP DQPVYTVSLP ENSPPGTLVI QUJATDPDBG QNGBWYSFS 8HISPRAREL 300 

PGL8PRTGRL BVSGRLDYEE SPVYQVYVQA KDLGPNAVPA HCKVLVRVLD ANDNAPEISP 360 

STVKEAVSEG AAPGTWALF SVTDRDSEEN GQVQCELLOD VPPRLKSSFK NYYTIVTEAP 420 

LDREAGDSYT LTWARDRGE PALSTSKSIQ VQVSDVNDNA PRFSQPVYDV YVTENNVPGA 480 

YIYAVSATDR DEGANAQLAY SILECQIQGM SVFTYVSINS ENGYLYALRS FDYEQLKDFS 540 

FQVBARDAGS PQALAGNATV NILIVDQNDN APAIVAPLPG RNGTPAREVL PRSABPGYLL 600 

TRVAAVDADD GBNARLTYSI VRGNEMNLFR MDWRTGELRT ARRVPAKRDP QRPYELVIEV 660 

RDKGQPPLSS TATLWQLVD GAVEPQGGGG SGGGGSGEHQ RPSRSGGGET SLDLTLILII 720 

ALGSVSPIPL LAMIVIAVRC QKEKKLNIYT CLASDCCLCC CCCGGGGSTC CGRQARARKK 780 

KliSKSDIKLV QSSNVPSNPA QVPIEESGGP GSHHHNQNYC YQVCLTPESA KTDLMFLKPC 840 

SPSRSTDTEH NPCGAIVTGY TDQQPDIISN GSILSNETKH QRAELSYLVD RPRRVNSSAF 900 

QEADIVSSKD SGHGOSEQGD SDHDATNRAQ SAGKDLFSNC TEBCKALGHS DRCWMPSPVP 960 

SDGRQAADYR SNLHVPGMDS VPDTEVFETP EAQPGAERSF STFGKEKALH STLERKBLDG 1020 
LLTNTRAPYK PPYUTRKRIC 

Seq ID NO: 159 DNA sequence 

Nucleic Acid Accession #: NM_022159.1 

Coding sequence: 70-1890 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i I I 1 I I 

GTGAAATTTA AACTCCAGTC CTGTGGCGAA AATGCTAATT GCACTAACAC AGAAGGAAGT 60 

TATTATTG TA TGT GTGTACC TGGCTTCAGA TCCAGCAGTA ACCAAGACAG GTTTATCACT ' 120 

AATGATGGAA CCGTCTGTAT AGAAAATGTG AATGCAAACT GCCATTTAGA TAATGTCTGT 180 

ATAGCTGCAA ATATTAATAA AACTTTAACA AAAATCAGAT CCATAAAAGA ACCTGTGGCT 240 

TTGCTACAAG AAGTCTATAG AAATTCTGTG ACAGATCTTT CACCAACAGA TATAATTACA 300 

TATATAGAAA TATTAGCTGA ATCATCTTCA TTACTAGGTT ACAAGAACAA CACTATCTCA 360 

GCCAAGGACA CCCTTTCTAA CTCAACTCTT ACTGAATTTG TAAAAACCGT GAATAATTTT 420 

GTTCAAAGGG ATACATTTGT AGTTTGGGAC AAGTTATCTG TGAATCATAG GAGAACACAT 480 

CTTACAAAAC TCATGCACAC TGTTGAACAA GCTACTTTAA GGATATCCCA G AGCTTC CAA 540 

AAGACCACAG AGTTTGATAC AAATTCAACG GATATAGCTC TCAAAGTTTT CTTTTTTGAT 600 

TCATATAACA TGAAACATAT TCATCCTCAT ATGAATATGG ATGGAGACTA CATAAATATA 660 

TTTCCAAAGA GAAAAGCTGC ATATGATTCA AATGGCAATG TTGCAGTTGC ATTTTTATAT 720 

TATAAGAGTA TTGGTCCTTT GCTTTCATCA TCTGACAACT TCTTATTGAA ACCTCAAAAT 780 

TATGATAATT CTGAAGAGGA GGAAAGAGTC ATATCTTCAG TAATTTCAGT CTCAATGAGC 840 

TCAAACCCAC CCACATTATA TGAACTTGAA AAAATAACAT TTACATTAAG TCATCGAAAG 900 

GTCACAGATA GGTATAGGAG TCTATGTGCA TTTTGGAATT ACTCACCTGA TACCATGAAT 960 

GGCAGCTGGT CTTCAGAGGG CTGrGAGCTG ACATACTCAA ATGAGACCCA CACCTCATGC 1020 

CGCTGTAATC ACCTGACACA TTTTGCAATT TTGATGTCCT CTGGTCCTTC CATTGGTATT 1080 

AAAGATTATA ATATTCTTAC AAGGATCACT CAACTAGGAA TAATTATTTC ACTGATTTGT 1140 

CTTGCCATAT GCATTTTTAC CTTCTGGTTC TTCAGTGAAA TTCAAAGCAC CAGGACAACA 1200 

ATTCACAAAA ATCTTTGCTG TAGCCTATTT CTTGCTGAAC TTGTTTTTCT TGTTGGGATC 1260 

AATACAAATA CTAATAAGCT CTTCTGTTCA ATCATTGCCG GACTGCTACA CTACTTCTTT 1320 

TTAGCTGCTT TTGCATGGAT GTGCATTGAA GGCATACATC TCTATCTCAT TGTTGTGGGT 1380 

GTCATCTACA ACAAGGGATT TTTGCACAAG AATTTTTATA TCTTTGGCTA TCTAAGCCCA 1440 

GCCGTGGTAG TTGGATTTTC GGCAGCACTA GGATACAGAT ATTATGGCAC AACCAAAGTA IS 00 

TGTTGGCTTA GCACCGAAAA CAACTFTATT TGGAGTTTTA TAGGACCAGC ATGCCTAATC 1560 

ATTCTTGTTA ATCTCTTGGC TTTTGGAGTC ATCATATACA AAGTTTTTCG TCACACTGCA 1620 

GGGTTGAAAC CAGAAGTTAG TTGCTTTGAG AACATAAGGT CTTGTGCAAG AGGAGCCCTC 1680 

GCTCTTCTGT TCCTTCTCGG CACCACCTGG ATCTTTGGGG TTCTCCATGT TGTCCACGCA 1740 

TCAGTGGTTA CAGCTTACCT CTTCACAGTC AGCAATGCTT TCCAGGGGAT GTTCATTTTT 1800 

TTATTCCTGT OTOTTTTA TC TAGAAAGATT CAAGAAGAAT ATTACAGATT GTTCAAAAAT 1860 

GTCCCCTGTT GTTTTGGATG TTTAAGGTAA ACATAGAGAA TGGTGGATAA TTACAACTGC 1920 

ACAAAAATAA AAATTCCAAG CTGTGGATGA CCAATGTATA AAAATGACTC ATCAAATTAT 1980 

CCAATTATTA ACTACTAGAC AAAAAGTATT TTAAATCAGT TTTTCTGTTT ATGCTATAGG 2040 

AACTGTAGAT AATAAGGTAA AATTATGTAT CATATAGATA TACTATGTTT TTCTATGTQA 2100 

AATAGTTCTG TCAAAAATAG TATTGCAGAT ATTTGGAAAG TAATTGGTTT CTCAGGAGTG 2160 
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ATATCACTGC ACCCAAGGAA AGATTTTCTT TCTAACACGA GAAQTATATQ AATGTCCTGA 2220 

AQGAAACCAC TGGCTTGATA TTTCTGTGAC TCGTGTTGCC TTTGAAACTA GTCCCCTACC 2280 

ACCTOQGTAA TGAGCTCCAT TACAGAAAGT GGAACATAAG AGAATGAAGQ GGCAGAATAT 2340 

CAAACAOTQA AAAQOGAATG ATAAGATGTA TTTTGAATGA ACTGTTTTTT CTGTAGACTA 2400 

GCTGAGAAAT TCTTGACATA AAATAAAGAA TTGAAGAAAC ACATTTTACC ATTTTGTGAA 2460 

TTGTTCTGAA CTTAAATGTC CACTAAAACA ACTTAGACTT CTGTTTGCTA AATCTOTTTC 2520 
TTTTTCTAAT ATTCTAAAA 

Seq ID No: 1S9 gypteifl sequence j 
Protein Accession #: NPJ>71442.1 



1 11 21 31 41 51 

I I ] I I I 

MCVPQFRSSS NQDRFITNDG TVCIENVNAN CKLDNVCIAA NINKTLTKIR SIKEPVALLQ 60 

EVYRNSVTDL SPTDIITYIE ILAESSSLDQ YKNNTISAKD TLSNSTLTEP VKTVNNFVQR 120 

DTFWWDKLS VNHRRTHLTK LMHTVEQATL RIBQSFQKTT EPDTNSTDIA LKVPPFDSYN 180 

MKHIHPHMNM DGDYINIPPK RKAAYDSNGN VAVAFLYYKS IGPLLSSSDN FLLKPQNYDN 240 

SBEEERVISS VISVSMSSNP PTLYELEKIT FTLSHRJCVTD RYRSLCAFWN YSPDTMNGSW 300 

8SEGCELTYS NETHTSCRCN HLTHPAILMS SGPSIGIKDY NZLTRITQLG IIISLICLAI 360 

CIFTPWPPSE IQSTRTTIHK NLCCSLFLAE LVPLVGINTN TNKLPCSIIA GLLHYPFLAA 420 

FAWMCIEGIH LYLIWGVIY NKGFLHKNPY IFGYLSPAW VGFSAALGYR YYGTTKVCWL 480 

STENNFIWSF IGPACLIILV NLLAPGVIIY KVPRHTAGLK PBVSCPEN1R SCARGALALL 540 

FLLGTTWIFG VIiHWKASW TAYLPTVSNA FQGMFIFLPL CVLSRKIQEH YYRLFKNVPC 600 
CFGCLR 



Seq ID KO: 160 pffl p«?qMem;e 

Nucleic Acid Accession #: none found 

Coding sequence: 1-216 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I 1 

TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG AACAGGATGG CAGAGATGAG CACCACCATC 60 

AAAAACTCAA GGACCAGTGC TGTGGGTCCA GTCATCTGTT TCATGGAATT CACCAGTCTG 120 

GTATCTTCAA AATCCAOAAG GATGATGGCA GATGGCAGGA AGGAGGAAGA GGGTAATCTG 180 

GAAGAGTTTC CTGACCTACT CTGCTGCTGT GATTAAACAA CCACCAGGAA ATTTTGATGA 240 

CACTGTTCTC CTGAGCTCCT CCCTTTCCTC GGGGAAGAAA AGCATTGAAA CTACAAAAAT 300 

AAAGTGTTAT TTGGCTGGAG TGAGGTCTCA TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG 360 

AACAGGGAAC CATTGGAGAT ACTCATTACT CTTTGAAGGC TTACAGTGGA ATGAATTCAA 420 

ATACGACTTA TTTGAGGAAT TGAAGTTGAC TTTATGGAGC TGATAAGAAT CTTCTTGGAG 480 

AAAAAAAGAC TGGTACTTCT GAATTAACCA AAATCACAGT ATTCTGAAGA TGATTCTACA 540 

AAGCCTGCTG TTTCTACAAA GGCTGCTGAT GATTTCTACA AAGCCTGCTG TAGTGTTGCT 600 

GTGGCCTCTG CTTAAAAAAG TAGAAAACAC ATTGATGCAG CATGTTCACC CCAACCTCCC 660 

TGCCTAAAGG CTCAOGGACC ATCTTGGAAG AGGAAGGCGC GTGAGATTGT AAGAGCCGAA 720 

TTAGGGGGAT GGAGTGTGGA GAATAAGGAC ACTTCATCTT GGATGCTCAC CTGCCAAATT 780 

GACTTCTGAT GAAAGCCAGC TCCAGAAATG TGCCTACAGT TACTACTTTC ACCTAAACCC 840 

TGCCCTTAGT CAAATCCTTC TCTTCTTCTA AGCAATCAAC TTCAATTCCT TGTATAACCC 900 

ACAGTATAAA AGGGCTTTTA TACCATTCTA TCCTATTGCA TGTAAGCCTT GGGTCTGGGA 960 

GGTAACAGTG TGGGATTCCA CCATCTCATC TCCCTGCCAC CCAAACATGC CTGCTCTTCT 1020 
TTAAGCAATA TTAAATGTTT GTACTTCA 

Seq ID No: 161 protein sequence; 
Protein Accession 8: none found 



1 11 21 31 41 51 

I I I I 1 I 

CLLMRWIAAQ NRMAEMSTTI KNSRTSAVGP VTCFMBFTSL VSSKSRRHMA DGRKBEEGNL 60 
EEFPDLLCCC D 

Seq ID NO: 162 Dflft seoyence 

Nucleic Acid Accession ft: none found 

Coding sequence: 1-159 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I.I I I I 

GAGACCCTCC AGAGGCAGGG CCCAGGATTG AAGAGGGAAG CCCTGCTCCA CACGTGTTCA 60 

TCAGGAAGGA CCCACAGACT GCTGCTCCTG GAGGCCTCTC GGTTTATGGA TGTGTOTTTG 130 

TTCCATAAAC CCTCAGAGGG TCACCTGGAG ACCCGCTAAA ATGCAGGTTC TTGGGCCACA 180 

TCCTAGACCT TCTGACCGAC CCAGGGAGTG GGGCCCAGGA AGCTGCATTT GACAGATATC 240 

CCCGTGTGAT CATCATGCAC ACAGGAGTGA GAGAACCAGT GTTCTCCCCG GGCAGAAGGG 300 

AAGCTCGTGT GCAGGACACC TCACACCTCC TTTCCCATTC CCCTGCCAGG CTCTCCCTGC 360 

TGACATTGTT TTTGCGGGAG AGCTGTGAAT TCTGAAGATT AGGTTGCTTC TCACCCCAAG 420 
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CTCCAGAAGT CCAGGCTGAG CCAAACCAAG CTTCAAGTTG TGCCTGGACT TGGAGAACCA 480 

GGAGGTGAGG GGACTGACTA CTTGAAGATC ACATGGAGGA GGAQTCTGAT CCAGGCCCAG 540 

GCACCAAGGA AAGGCCATGC AAGGACACAG GGAGAAGGGC AGCTGTCTGT AAGCCAGAAA 600 

GAGCCTTCAC TAGAAACCAA ATCAGCCAGA ACCTTCATCT TOGACTTTCC AGCCTTCAGA 660 
QATGTGAAAA AATAAATTTC TGTTOATTAA CCTAAAAAA 



Seq ID No i 163 Protein sequence i 
Protein Accession 8: none found 



1 11 21 31 41 51 

I I I I ) I 

ETLQRQGPOL KREAIiLHTCS SGRTHRLLLL EASRFHDVCL FHKPSEGHLB TR 

Seq ID NOs 164 PNA sequence , 

Nucleic Acid Accession 8: NM_020241.1 

Coding sequence: 4-1557 (underlined sequences correspond to start and stop codons) 



GC CATG CAGA 
CTGGGGGG.CG 
GACTACCTQA 
GAAGGTGCTG 
GGGQACAGGG 
TACCAGAGGA 
GGCAAACAGG 
ACGCTCTTTG 
ACCCTGCAGC 
CACGCCAATG 
CTAGCCATTG 
AAACATQACT 
CATOTCTACT 
GTGTCCCGCG 
AAGCAGTGGA 
TTCTACTTCA 
QTCCTGGCCG 
GACCTGACAC 
TCCATCTGGA 
GCCCCCGGGA 
AAGACCCACC 
CGGACCCTGA 
GGCAACCAGA 
CGGCCCAATG 
AGGGTGTGTG 
CGACGCTGGG 
CCCCCCACTC 
AGGGCCTGCC 
CGGCGAAGGT 
GCCACCCGTC 
G 



11 
1 

CCCCQCGAGC 
CCCACGGCCT 
ACCACTATCC 
ACGACCTCAA 
ACAACCTCTA 
AGCTGACCTG 
AGGGCGAGTG 
TGTGCGGTTC 
CCGTCGGAGA 
TTGCCCTCTT 
ATGCTGTCAT 
CCAAGTGGTT 
TCTTCTTCCG 
TGGCCCGAGT 

ACGTGCTGCA 
TTTTTTCCAC 
AGGTGGCAGC 
CGCCGGTGCC 
TGCAGTACAA 
CTCTGATGGA 
TGAGGCACCA 
CCGTTGTCTT 
CCAGCACCTC 
TCCACGAGCG 
GCTTCCAGAA 
TGCAGAGGGA 
CGGAAGTCAC 
GGGTGGGGCC 
CCCTTGTGAC 



21 
I 

GTCCCCTCCC 
CTTTCCTGAG 
CGTGTTTGTG 
CATCCAGCGA 
CCGOGTAGAG 
GAGATCTAAC 
TCGAAACTTC 
CAACGCCTTC 
CAACATCAGC 
CTCTGACGGG 
CTACCGCAGC 
CAAAGAGCCT 
GGAGATTGCG 
GTGCAAGAAC 
GAAGGCGCGG 
GGCTGTCACG 
GCCCAGCAAC 
TGTGTTTGAA 
GGAGGATCA8 
TGCCTCCAGC 
CGAAGCGGTO 
GCTGACTCGA 
CCTGGGTTCT 
AGGGACGTCT 
ACGATCGTGG 
GGCCCGGGGG 
AGCGGGGACA 
ATCGGCAGCA 
CCTCTGTAAA 
CTCCCCCCTC 



31 
I 

CGCCCGGCCC 
GAGCCGCCGC 



GTCCTGCGGG 
TTGGAGCCCC 
CCCAGCGACA 
GTAAAGGTGC 
AACCCGGTGT 
GGTATGGCCC 
ATGCTCTTCA 



TACTTTGTCC 
ATGGAGTTTA 
GACGTGGGAG 
CTCAACTGCT 
GGOGTGGTCA 
AGCATCCCTG 
GGCCGCTTCC 
GTGCCTCGAC 
GCCTTGCCGG 
CCCTCGCTGG 
GTGGCTGTGG 
GAGQCGGGGA 
GGGCGTGTGT 
TGGCCCCAGC 
CCTCCGAGGT 
ATGCCGGGGT 
GCTGTCTAAA 
TACGGCCCCA 
TGACCTCCAG 



41 
I 

TCCTGCTTCT 
CGCTTAGCGT 
CCGGAOGCCT 
TCAACAGGAC 
CCACGTCCAC 
TAAACGTGTG 
TGCTCCTTOG 
GCGCCAACTA 
GCTGCCCGTA 
CAGCTACTGT 
GGCCCACCCT 
ATGOGGTGGA 
ACTACCTGGA 
GCTCCCCCOG 
CTGTACCCGG 
GCCTCGGGGG 
GCTCGGCTGT 
GAGAGCAGAA 
CCCGGCCCGG 
ATGACATCCT 
GCCATGCGCC 
ACGTGGGAGC 
CGGTCCTCAA 
GTCAAGTGGG 
GGCCTGGGCG 
GCCGGTTAGG 
TTCAGGCAGG 
GGGCTTGGGG 
GGGTGGTGAG 
CTGACCATGC 



SI 
I 

GCTGCTGCTA 
GGCCCCCAGG 
GACCCCCGCA 
GCTGTTCATT 
GGAGCTGOGG 
TOGGATGAAG 
GGACGAGTCC 
CAGCATAGAC 
CGACCCCAAG 
TACCGACTTC 
GOGCACCGTG 
GTGGGGCAGC 
GAAGGTGGTG 
CGTGCTGGAG 
AGACTCCCAT 
CCGGCCCGTG 
CTGCGCCTTT 
GTCCCCCGAG 
GTGCTGCGCA 
CAACTTTGTC 
CTGGATCCTG 
CGGCCCCTGG 
GTTCCTCGTC 
CCACGCGTGC 
TTGGCTGAGC 
AGTTTGAACC 
AOACACGAGG 



AGAGTCCCAT 
ATGCCACGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



Seq ID No; 165 Protean sequence; 
Protein Accession #: NP 064626.1 



MQTPRASPPR 
GADDLNTQRV 
KQEGECRNFV 
ANVALFSDGH 
VYFFFREIAM 
YFNVLQAVTG 
IWTPVPEDQV 
TLMRHQLTRV 
VCVHERRSWW 



11 
1 

PAT.T.TjTfT.T.T.T. 

LRVNRTLPIG 
KVUiLRDEST 
ItPTATVTDFL 
BFNYLEKWV 
WSLGGRPW 
PRPRPGCCAA 
AVDVGAGPWG 
PQRPGRWLSR 



21 
I 

GGAHGLFPBE 
DRDNLYRVEL 
LFVCGSNAFN 
AIDAVTYRSL 
8RVARVCKND 
LAVFSTPSNS 
PGMQYNASSA 
NQTWFLGSB 
RHGFQKARGP 



31 

I 

PPPLSVAPRD 
EPPTSTELRY 
PVCANYSIDT 
GDRPTLRTVK 
VGGSPRVLEK 
IPGSAVCAPD 
LPDDILNFVK 
AGTVLKFLVR 
PRCRLGV 



41 
I 

YLNHYPVFVG 
QRKLTWRSNP 
LQPVGDNISG 
HDSKWFKBPY 
QWTSFLKARL 
LTQVAAVFEG 
THPLMDBAVP 
PNASTSGTSG 



51 
I 

SGPGRLTPAE 
SDINVCRMKQ 
MARCPYDPKH 
FVHAVBWGSH 
NCBVPGDSHF 
RFREQKSPES 
SLGHAPWILR 
RVCQVGKACR 



60 
120 
180 
240 
300 
360 
420 
480 



Seq ID NO: 166 pNA. sequence, 

Nucleic Acid Accession #: NH_032108.1 

Coding sequence! 39-270S (underlined sequences correspond to start and stop codons) 



51 



1 11 21 31 41 

I I I I I 1 

TCCGAGGCGT CACCTCCTCC TGTCGCCTGG CCCTCGC CAT Q CAGACCCCG CGAGCGTCCC 60 

CTCCCCGCCC GGCCCTGCTG CTTCTGCTGC TGCTACTGGG GGGCGCCCAC GGCCTCTTTC 120 
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CTGAGGACCC OCCGCCGCTT AGCGTGGCCC CCAGGGACTA CCTGAACCAC TATCCCOTCT 180 

TTGTGGGCAG CGGGCCCGGA CGCCTGACCC CCGCAGAAGG TGCTGACGAC CTCAACATCC 240 

AGCGAGTCCT GCGGGTCAAC AGGACGCTGT TCATTGGGGA CAGGGACAAC CTCTACOGOG 300 

TAGAGCTGGA GCCCCCCAOG TCCACGGAGC TGCGGTACCA GAGGAAGCTG ACCTGGAGAT 360 

5 CTAACCCCAG CGACATAAAC GTGTGTCGOA TGAAGGGCAA ACAGGAGGGC GAGTGTCGAA 430 

ACTTCGTAAA GGTGCTGCTC CTTOGGGAGG AGTCCACGCT CTTTGTGTGC GGTTCCAACG 480 

CCTTCAACCC GGTGTGCGCC AACTACAGCA TAGACACCCT GCAGCCCOTC GGAGACAACA 540 

TCAGCGGTAT GGCCCGCTGC CCGTACGACC CCAAGCAOGC CAATGTTGCC CTCTTCTCTG 600 

AOGGGATGCT CTTCACAGCT ACTGTTACCG ACTTCCTAGC CATTGATGCT GTCATCTACC 660 

10 GCAGCCTCGG GGACAGGCCC ACCCTGCGCA CCGTGAAACA TGACTCCAAG TGGTTCAAAG 720 ' 

AGCCTTACTT TGTCCATGOG GTGGAGTGGG GCAGCCATGT CTACTTCTTC TTCOGGGAGA 780 

TTGCGATGGA GTTTAACTAC CTGGAGAAGG TGGTGGTGTC CCGCQTGGCC OGAGTGTGCA 840 

AGAACGACGT GGGAGGCTCC CCCCGCGTGC TGGAGAAGCA GTGGAOGTCC TTCCTGAAGG 900 

CGCGGCTCAA CTGCTCTGTA CCCGGAGACT CCCATTTCTA CTTCAACGTG CTGCAGGCTG 960 

15 TCACGGGCGT GGTCAGCCTC GGGGGCCGGC CCGTGGTCCT GGCCGTTTTT TCCACGCCCA 1020 

GCAACAGCAT CCCTGGCTCG GCTGTCTGCG CCTTTGACCT GACACAGGTG GCAGCTGTGT 1080 

TTGAAGGCCG CTTCCGAGAG CAGAAGTCCC CCGAGTCCAT CTGGACGCCG GTGCCGGAGG 1140 

ATCAGGTGCC TCGACCCCGG CCCGGGTGCT GCGCAGCCCC CGGGATGCAG TACAATGCCT 1200 

CCAGCGCCTT GCCGGATGAC ATCCTCAACT TT3TCAAGAC CCACCCTCTQ ATGGACGAGG 1260 

20 CGGTGCCCTC GCTGGGCCAT GCGCCCTGGA TCCTGCGGAC CCTGATGAGG CACCAGCTGA 1320 

CTCGAGTGGC TGTGGACGTG GGAGCCGGCC CCTGGGGCAA CCAOACOGTT GTCTTCCTGG 1380 

GTTCTGAOGC GGGGACGGTC CTCAAGTTCC TCGTCCGGCC CAATGCCAGC ACCTCAGGGA 1440 

CGTCTGGGCT CAGTGTCTTC CTGGAGGAGT TTGAGACCTA CCGGCCGGAC AGGTGTGGAC 1500 

GGCCCGGCGG TGGCGAGACA GGGCAGCGGC TGCTGAGCTT GGAGCTGGAC GCAGCTTOGG 1560 

25 GGGGCCTGCT GGCTGCCTTC CCCCGCTGCG TGGTCCGAGT GCCTOTGGCT CGCTGCCAGC 1620 

AGTACTCGGG GTGTATGAAG AACTGTATCG GCAGTCAGGA CCCCTACTGC GGGTGGGCCC 1680 

CCGACGGCTC CTGCATCTTC CTCAGCCCGG GCACCAGAGC CGOCTTTGAG CAGGAOGTGT 1740 

CCGGGGCCAG CACCTCAGGC TTAGGGGACT GCACAGGACT CCTGOGGGCC AGCCTCTCCG 1800 

AGGACCGCGC GGGGCTGGTG TCGGTGAACC TGCTGGTAAC GTCGTCGGTG GCGGCCTTCG 1860 

30 TGGTGGGAGC CGTGGTGTCC GGCTTCAGCQ TGGGCTGGTT CGTGGGCCTC CGTGAGCGGC 1920 

GGGAGCTGGC CCGGCGCAAG GACAAGGAGG CCATCCTGGC GCACGGGGCG GGCGAGGCGG 1980 

TGCTGAGOGT CAGCCGCCTG GGCGAGCGCA GGGCGCAGGG TCCOGGGGGC CGGGGCGGAG 2040 

GCGGTGGOGG TGGCGCCGGG GTTCCCCCGG AGGCCCTGCT GGCOCCCCTG ATGCAGAACG 2100 

GCTGGGCCAA GGCCACGCTG CTGCAGGGCG GGCCCCAOGA CCTGGACTOG GGGCTGCTGC 2160 

35 CCACGCCCGA GCAGACGCCG CTGCCGCAGA AGCGCCTGCC CACTCCGCAC CCGCACCCCC 2220 

ACGCCCTGGG CCCCCGCGCC TGGGACCACG GCCACCCCCT GCTOCCGGCC TCCGCTTCAT 2280 

CCTCCCTCCT GCTGCTOGCG CCCGCCCGGQ CCCCCGAGCA GCCCCCCGCG CCTGGGGAGC 2340 

CGACCCCCGA OGGCCGCCTC TATGCTGCCC GGCCCGGCGG CGCCTCCCAC GGCGACTTCC 2400 

CGCTCACCCC CCACGCCAGC CCGGACCGCC GGCGGGTGGT GTCCGCGCCC ACGGGCCCCT 2460 

40 TGGACCCAGC CTCAGCCGCC GATGGCCTCC CGCGGCCCTQ GAQCCCGCCC CCGACGGGCA 2520 

GCCTGAGGAG GCCACTGGGC CCCCACGCCC CTCCGGCCGC CACCCTGCGC CGCACCCACA 2580 

CGTTCAACAG CGGCGAQGCC CGGCCTGGGG ACCGCCACCG CGGCTGCCAC GCCCGGCCGG 2640 

GCACAQACTT GGCCCACCTC CTCCCCTATQ GGGGGGCGGA CAGGACTGCG CCCCCCGTGC 2700 

CCTAGGCCGG GGGCCCCCCG ATGCCTTGGC AGTGCCAGCC ACGGGAACCA GGAGCGAGAG 2760 

45 ACGGTGCCAG AACGCCGGGG COCGGGGCAA CTCCGAGTGG GTGCTCAAGT CCCCCCOGCG 2820 

ACCCACCCGC GGAGTGGGGG GCCCCCTCCG CCACAAGGAA GCACAACCAG CTCGCCCTCC 2880 

CCCTACCCGG GGCCGCAGGA CGCTGAGACG GTTTGGGGGT GGGTGGGCGG GAGGACTTTG 2940 

CTATGGATTT GAGGTTGACC TTATGCGCGT AGGTTTTGGT TTTTTTTGCA GTTTTGGTTT 3000 

CTTTTGCGGT TTTCTAACCA ATTGCACAAC TCCGTTCTCG GGGTGGCGGC AGGCAGGGGA 3060 

50 GGCTTGGACG CCGGTGGGGA ATGGGGGGCC ACAGCTGCAG ACCTAAGCCC TCCCCCACCC 3120 

CTGGAAAGGT CCCTCCCCAA CCCAGGCCCC TGGCGTGTGT GGGTGTGCGT GCGTGTGCGT 3180 

GCCGTGTTCG TGTGCAAGGG GCCGGGGAGG TGGGCOTGTG TQTGCGTGCC AGCGAAGGCT 3240 

GCTGTGGGCG TGTGTGTCAA GTGGGCCACG CGTGCAGGGT OTGTGTCCAC GAGCGACGAT 3300 

CGTGGTGGCC CCAGCGGCCT GGGCGTTGGC TGAGCCGACG GTOGGGCTTC CAGAAGGCCC 3360 

55 GGGGGTCTCC GAGGTGCCGO TTAGGAGTTT GAACCCCCCC CACTCTGCAG AGGGAAGCGG 3420 

GGACAATGCC GGGGTTTCAG GCAGGAGACA CGAGGAGGGC CTGCCCGGAA GTCACATCGG 3480 
CAGCAGCTGT CTAAAGGGCT TGGGGGCCTG GGGGGCGGCG AAAG 



60 Seq ID No: 167 Protein gequeqce { 
Protein Accession #i NP_115484.1 

1 11 21 31 41 51 

_ I I I I I I 

65 KQTPRASPPR PALTiTiTiTiLLL GGAHGLFPEO PPPLSVAPRD YLNHYPVFVG SQPGRLTPAE 60 

GADDLNIQRV LRVNRTLPIG DRDNLYRVBL EPPTSTELRY QRKLTWRSHP SDINVCRMKG 120 

KQEGECRNFV KVLLLRDBST LFVCGSNAFN PVCANYSIDT LQPVGDNISG NARCPYDPKH 180 

AKVALFSDGM LPTATVTDFL AIDAVIYRSL GDRPTLRTVK HDSKWFKEPY PVHAVBWGSH 240 

VYFFPREIAM EFNYTiEKWV SRVARVCKND VGGSPRVLEK QWTSPLKARL NCSVPGDSHF 300 

70 YFNVLQAVTG WSLGGRPW LAVFSTPSNS IPGSAVCAFD LTQVAAVFBG RFREQKSPES 360 

IWTPVPEDQV PRPRPGCCAA PGMQYNASSA LPDDILHPVK THPLMDBAVP SLGHAPWILR 420 

TLMRHQLTRV AVDVGAGPWG NQTWPLGSE AGTVLKFLVR PHASTSGTSG LSVFLBEFBT 480 

YRPDRCGRPQ GGBTGQRLLS LELDAASGGL IAAFPRCWR VPVARCQQYS GCHKNCIGSQ 540 

OPYCGWAPDG SCIFLSPGTR AAFEQDVSGA STSGLGDCTG LLRASL8EDR AGLV8VNLLV 600 

75 TSSVAAFWG AWSGFSVGW FVGLRBRRBL ARRKDKEAIL AKGAGEAVLS VSRLGBRRAQ 660 

GPGGRGGGGG GGAGVPPBAL LAPLMQNGWA KATLLQGGPH DLDSGLLPTP EQTPLPQKRL 720 
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PTPHPHPHAL GPRAWDHGHP IiLPASASSSL LLLAPARAPB QPPAPGEPTP DGRLYAARPG 780 
RASHGDFPLT PHASPDRRRV VSAPTGPLDP ASAADOLPRP HSPPPTOSLR RPLGPHAPPA B40 
ATLRRTHTFN SGBARPGDRH RGCHARPGTD LAHLLPYGGA DRTAPPVP 



Seq ID NO: 168 DNA sequence 
Nucleic Acid Accession #: AW205664 

Coding sequence: 1-135 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I I I I I 

2SSCAC0AGG AQAACAGGGG CCTCTGCCTC AGTTTGCCCG GGAGCCAGCC AGGGCCCATC 60 

CTAATTTGGA GCACAGTCTT CCOGGTGCCT AGACATGCCA AGGCCCCTCC CACGTGGTAC 120 

ACCCTCTCCG TTTAGTACCT GACCACCTGT TTCAAAACGC AGGTGTTTCT GGTTTAGAAA 180 

CTTGGAAGGC GGAATGTGTT TTCGTGTCTT CTAGGAAGGG TCTGCTGAGG ACCAGACCAC 240 

GTAAGCCTGA GTGGATCCTG ACTCAGCTGC AGCCCTTACC TGCCTCGTGC TGATGATCTA 300 

TGCATGGCGT TATOTAGATC ACGTGCGGCA GAGACAGCCA CTGTCCTGTG TGCGGGTTTT 360 
TAAAACAGCT GCCCTGGATG AAAOGGAATA AACCAGTGAT GCTAAAAAAA AAAAAAAAAA 



Seq ID No: 169 Protein sequence: 
Protein Accession #: AW205664 



1 11 • 21 31 41 51 

I I I I \ I 

RKEENRGLCL SLPGSQPGPI LIWSTVFPVP RHAKAPPTWY TLSV 



Seq ID NO: 170 DNA sequence 
Nucleic Acid Accession #: AB033100 

Coding sequence: 32-2623 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 

I I 1 I I I 

AGGTCTGGGG TCCTGAGGCT GCTQGCAGAC TATGGGTACA ACGGCCAGCA CAGCCCAGCA 60 

GACGGTCTCG GCAGGCACCC CATTTGAGGG CCTACAGGGC AGTGGCACGA TGGACAGTCG 120 

GCACTCCGTC AGCATCCACT CCTTCCAGAG CACTAGCTTG CATAACAGCA AGGCCAAGTC 180 

CATCATCCCC AACAAGGTGG CCCCTGTTGT GATCACGTAC AACTGCAAGG AGGAGTTCCA 240 

GATCCATGAT GAGCTGCTCA AGGCTCATTA CACGTTGGGC CGGCTCTCGG ACAACACCCC 300 

TGAGCACTAC CTGGTGCAAG GAGCTCAGGC CTTACCCCAG GGCCGCTACT TCCTGGTGCG 360 

GGATGTCACT GAGAAGATGG ATGTGCTGGG CACCGTGGGA AGCTGTGGGG CCCCCAACTT 420 

CCGGCAGGTG CAGGGTGGGC TCACTGTGTT CGGCATGGGA CAGCCCAGCC TCTTAGGGTT 480 

CAGGCGGGTC CTCCAGAAAC TCCAGAAGGA CGGACATAGG GAGTGTGTCA TCTTCTGTGT 540 

GCGGGAGGAA MCTGTGCTTT TCCTGCGTGC AGATGAGGAC TTTGTGTCCT ACACACCTCG 600 

AGACAAGCAG AACCTTCATG AGAACCTCCA GGGCCTTGGA CCCGGGGTCC GGGTGGAGAG 660 

CCTGGAGCTG GCCATCCGGA AAGAGATCCA CGACTTTGCC CAGCTGAGCG AGAACACATA 720 

CCATGTGTAC CATAACACCG AGGACCTOTG GGGGGAGCCC CATGCTGTGG CCATCCATGG 780 

TGAGGACGAC TTGCATGTGA CGGAGGAGGT GTACAAGCGG CCCCTCTTCC TGCAGCCCAC 840 

CTACAGGTAC CACCGCCTGC CCCTGCCCGA GCAAGGGAOT CCCCTGGAGG CCCAGTTGGA 900 

CGCCTTTGTC AGTGTTCTCC GGGAGACCCC CAGCCTGCTG CAGCTCCGTG ATGCCCACGG 960 

GCCTCCCCCA GCCCTCGTCT TCAGCTGCCA GATGGGCGTG GGCAGGACCA ACCTGGGCAT 1020 

GGTCCTCGGC ACCCTCATCC TQCTTCACCG CAGTGGGACC ACCTCCCAGC CAGAGGCTGC 1080 

CCCCACGCAG GCCAAGCCCC TGCCTATGGA GCAGTTCCAG GTGATCCAGA GCTTTCTCOG 1140 

CATGGTGCCC CAGGGAAGGA GGATGGTGGA AGAGGTGGAC AGAGCCATCA CTGCCTGTGC 1200 

CGAGTTGCAT GACCTGAAAG AAGTGGTCTT GGAAAACCAG AAGAAGTTAG AAGGTATCCG 1260 

ACCGGAGAGC CCAGCCCAGG GAAGCGGCAG CCGACACAGC GTCTGGCAGA GGGCGCTGTG 1320 

GAGCCTGGAG CGATACTTCT ACCTGATCCT GTTTAACTAC TACCTTCATG AGCAGTACCC 1380 

GCTGGCCTTT GCCCTCAGTT TCAGCCGCTG GCTGTGTGCC CACCCTGAGC TGTACCGCCT 1440 

GCCCGTGACG CTGAGCTCAG CAGGCCCTGT GGCTCCGAGG GACCTCATCG CCAGGGGCTC 1500 

CCTACGGGAG GACGATCTGG TCTCCCCGGA CGCGCTCAGC ACTGTCAGAG AGATGGATOT IS 60 

GGCCAACTTC CGGCGGGTGC CCCGCATGCC CATCTACGGC ACGGCCCAGC CCAGCGCCAA 1620 

GGCCCTGGGG AGCATCCTGG CCTACCTGAC GGACGCCAAQ AGGAGGCTGC GGAAGGTTGT 1680 

CTGGGTGAGC CTTCGGGAGG AGGCCGTGTT GGAGTGTGAC GGGCACACCT ACAGCCTGCG 1740 

GTGGCCTGGG CCCCCT3TK3G CTCCTGACCA GCTGGAGACC CTGGAGGCCC AGCTGAAGGC 1800 

CCATCTAAGC GAGCCTCCCC CAGGCAAGGA GGGCCCCCTG ACCTACAGGT TCCAGACCTG 1860 

CCTTACCATG CAGGAGGTCT TCAGCCAGCA CCGCAGGGCC TGTCCTGGCC TCACCTACCA 1920 

CCGCATCCCC ATGCCGGACT TCTGTGCCCC CCGAGAGGAG GACTTTGACC AGCTGCTGGA 1980 

GGCCCTGCGG GCCGCCCTCT CCAAGGACCC AGGCACTGGC TTCGTGTTCA GCTGCCTCAG 2040 

CGGCCAGGGC CGTACCACAA CTGCGATGGT GGTGGCTGTC CTGGCCTTCT GGCACATCCA 2100 

AGGCTTCCCC GAGGTGGOTG AGGAGGAGCT CGTGAGTGTG CCTGATGCCA AGTTCACTAA 2160 

GGGTGAATTT CAGGTAGTAA TGAAGGTGGT GCAGCTGCTA CCCGATGGGC ACCGTGTGAA 2220 

GAAGGAGGTG GAOGCAGCGC TGGACACTGT CAGCGAGACC ATGACGCCCA TGCACTACCA 2280 

CCTGCGGGAG ATCATCATCT GCACCTACCG CCAGGCGAAG GCAGCGAAAG AGGCGCAGGA 2340 

AATGCGGAGG CTGCAGCTGC GGAGCCTGCA GTACTTGGAG CGCTATGTCT GCCTGATTCT 2400 
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CTTCAACGCG 
GATGCAGGAG 
CGAGCTGGAG 
GAGCTGCAGC 
CCCCCCACCC 
GGCCCTGAGO 
TTGGGAGCCT 
CAAACCACCA 
GTGCACACTG 
CCCCCAOTTG 
TGOCAGCCCC 
CTQCTCTGCC 
GCCTGAGGTG 
ACCCAGTTTT 
CCTACCCAGC 
AGGGAGCCCC 
AGGCCTGCTC 
GCTGCAGGTC 
CCTGCTGCAG 
TGGACAGTGA 
GGCCATCCTT 
CTGCTGAGTT 
CAGGTGGCAG 
GTGCTTGCTG 
GGGACCAGSA 
GATGTGTCAT 
CAGTATTGGA 
GTTCACTACC 
CTTTGCAGAC 
GTGTTCTCAT 
CTCTCTGCCT 
CRGGGGTGTT 
GCATTAAAAA 



TACCTCCACC 



AGCGGGGAGG 
CTCGAGCCCT 
ACAGGGCCCC 
GGTGCTGGCC 
TTTTAGAAAG 
AGGTGTGTGG 
CTGTGTGTAC 
CCAAACACTG 
TGGCACAGAG 
ATTGCCGCTC 
GGTGGAGQGG 
CTGGACTCTC 
CTGGTGGGGC 
TCTCATGGGG 
CACTTGTCTG 
CCCCGGCATC 
CAGGAGCCCC 
GGTGTGCAAG 
GCTGAGCATC 
AGAGGCTGCT 
AGAAGTGCCA 
AAACCCAGGA 
AAGCCTGTCT 
TGGTCATGAT 
AAATATTTGA 
TTTTCAGGTT 
AAGGTCTAGA 
GGTTGGCTCT 
CAGTTTCCCC 
GTGAGGATTC 
CAGCTAAATG 



TOGAGAAGGC 
AGGCTGGCAT 
ACCAGCCCTT 
CTGCCCCOGA 
AOGCAGGCCT 
TTGAAATGAT 
AACTTTTTAT 
CTGACCTCCA 
CTTGCAGACA 
TGGATCTCTC 
CAGACCXGGC 
CCCTTCTTGC 
ACAGTGTTCT 
ATGCCCCCAT 
TGGCAGGATG 
AGGAAAGAGC 
GGAACCTGGG 
TCTCTCTGTC 
AAGGAGTGCT 
GGTGCACTGA 
TTTGAGOCTG 
GGGATCCACT 
TGTTTGCGTT 
GCTGAACAGT 
TTGGTTAGGC 
ATTTGAAAAG 
CCCCCTTGGC 
TATTGTTTTT 
TGCGGAGTCA 
GACTTTCAGC 
ATCTGTAAAA 
ATTTGTGATT 
TG 



CGACTCCTGG 
CXACGAGATC 
CTCCAGGCTG 
GGACTTGCTG 
GGGGTGTCTG 
TCCCCCACTT 
AGGACAGGGA 
GGGAGGAGCA 
GGCCGGCGTT 
TGTCCTCTTC 
CACTGGTAGC 
TGCCCAAGCA 
GGATAGATCT 
CTCCGACCTG 
GTGGAGGTTT 
TTCCAGGGGG 
CAGGAGGCAC 
COGGCAGCCC 
AGCTGAGGGT 
GGGTGGTGGG 
CCTTCCGQTG 
GTTTCCACAC 
GAGCCTTGCA 
GAGGAGGCTG 
TOGTGTACTT 
GGGAGGAGGC 
TGAATTCTTT 
ATTTTTGCAT 
GAGATGGGAC 
TGTGTTGGGA 
TGGGAGAATA 

rm - riTirr 



CAGAGGCCCT 
CTTAAOGAGC 
CGCTACCGGT 
TAGGGGGCCT 
AGGTGCTCTT 

cctggagaga 
gacagcacag 
ctcactggag 
cagcctccaa 

TCCCCTCTCT 
TCCCCACTTC 
CTGCCCTCGG 
ATTATGTGAA 
GGAGACTTCA 
CTCAAGGAGC 
CGAAOGCAGC 
AGAGGAAGCC 
AGGATGGCCT 
GOTTGCTGGG 
AGGGGATCAC 
GGAGCAGAAA 
AGCGGGAAGG 
GCTCTTCCAG 
TCCACCTTGC 
CTGCAGGAAA 
CGAAGTTGTT 
TGCAGAACTA 
GAATTAAGAC 
TGAATGGGGA 
CCACTGGCTG 
ATACTTGCCT 
TTTTTGTACA 



TCAGCACCTG 
TGGGCTTCCC 
GGCAGGAGCA 
TACTCCCTGT 
GGCTGGGAGC 
CTGAGCGGAG 
CCATCCCTTG 
TGCTCACAAG 
GGGGCTCACT 
CAGATTGGCC 
CTTACTCCTG 
GCGTCTGGCA 
AGGCAGCTTC 
GGAATGACAA 
TGGAGACTTC 
ACAGAGGAAG 
AAGGCCTGGA 
GGTGCCCCCA 
GTGGTCCTCA 
CTGGGTTCCA 
AGGCCAGACC 
CTGCTGGGAA 
CTGGGGACTG 
TTGGCTCACT 



CCCATTTATC 
CTGTGTGTCT 
GTTTTAATTT 
GGGATCCTTT 
ATCACATCAC 
ACCTACCTCA 
GAGCTTTTAA 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 



Seq ID No: 171 Pro^jn aequence: 
Protein Accession #: BAA86588.1 



I 

KGTTASTAQQ 
ITYNCKEBFQ 
TVGSCGAPNP 
EDFVSYTPRD 
BPKAVAIHGE 
LLQfcRDAHGP 
FQVIQSFIiRM 
HSVWQRALWS 
PRDLIARGSL 
AKRRLRJCWW 
PLTYRFQTCL 
TGFVFSCLSG 
LLPDGHRVKK 
LKRYVCLILF 
RLRYRWQEQS 



11 
I 

TVSAGTPPEG 
IHDEXlLKAHY 
RQVQGGIjTVF 
KQNLHENLQG 
DDLHVTEEVY 
PPAfcVFSCQM 
VPQGRRMVEE 
LERYFYLILF 
REDD iiVS PDA 
VSLREEAVLE 
TMQBVFSQHR 
QGRTTTAMW 
BVDAALDTVS 
KAYLHLEKAD 
CSLEPSAPED 



21 
I 

LQGSGTKDSR 
TLGRLSDNTP 
GMGQPSLLGF 
LGPGVRVESL 
KRPLFLGPTY 
GVGRTNI/2MV 
VDRAITACAE 
NYYLHEQYPL 
LSTVREMDVA 
CDGHTYSLRW 
RACPGLTYHR 
AVLAFWHIQG 
ETMTPMHYHL 
SWQRPFSTWM 
LL 



31 

I 

HSVSIHSFQS 
EHYLVQGAQA 
RRVLQKLQKD 
ELAIRKEIHD 
RYHRLPLPEQ 
LGTLILLHRS 
LHDLKEWLE 
AFALSFSRWL 
NFRRVPRMPI 
PGPPVAPDQD 
1PMPDFCAPR 
FPEVGEEELV 
REIIICTYRQ 
QEVASKAGIY 



41 

I 

TSLHNSKAXS 
LPQGRYFLVR 
GHRECVIFCV 
FAQLSENTYU 
GSPLEAQLDA 
GTTSQPEAAP 
NQKKLEGIRP 
CAHPELYRLP 
YGTAQPSAKA 
ETLEAQLKAH 
EEDFDQLLEA 
SVPDAKFTKG 
AKAAKEAQEM 
EILNELGFPE 



51 
I 

IIPNKVAPW 
DVTEKMDVLG 
REEVLFLRAD 
VYHNTEDLWG 
FVSVLRETPS 
TOAKPLPMEQ 
BSPAQGSGSR 
VTLSSAGPVA 
LGSILAYLTD 
LSEPPPGKEG 
LRAALSKDPG 
EFQWMKWQ 
RRLQLiRSLQY 
LESGEDQPFS 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 



Seq ID NO: 172 DNA sequence 

Nucleic Acid Accession 8: AK021806.1 

Coding sequence: 1-645 (underlined sequences correspond to start and stop codons) 



1 11 21 

I I I 

ACTG TGCTTT TCCTGCGTGC AGATGAGGAC 
AACCTTCATG AGAACCTCCA GGGCCTTGGA 
GCCATCCGGA AAGAGATCCA CGACTTTGCC 
CATAACACCG AGGACCTGTG GGGGOAGCCC 
TTGCATGTGA CGGAGGAGGT GTACAAGCGG 
CACCGCCTGC CCCTGCCCGA GCAAGGGAGT 
AGTGTTCTCC GGGAOACCCC CAGCCTGCTG 
GCCCTCGTCT TCAGCTGCCA GATGGGCGTG 
ACCCTCATCC TGCTTCACCG CAGTGGGACC 
GCCAAGCCCC TGCCTATGGA GCAGTTCCAG 
CAGGGAAGGA GGATGGTGGA AGAGGTGGAT 
AGTTTTCTGG ACTCTCATGC CCCCATCTCC 
CCCAGCCTGG TGGGGCTGGC AGGATGGTGG 
AGCCCCTCTC ATGGGGAGGA AAGAGCTTCC 



31 41 51 

f I I 

TTTGTGTCCT ACACACCTCG AGACAAGCAG 60 

CCCGGGGTCC GGGTCGAGAG CCTGGAGCTG 120 

CAGCTGAGCG AGAACACATA CCATGTGTAC 180 

CATGCTGTGG CCATCCATGG TGAGGACGAC 240 

CCCCTCTTCC TGCAGCCCAC CTACAGGTAC 300 

CCCCTGGAGG CCCAGTTGGA CGCCTTTGTC 360 

CAGCTCCGTG ATGCCCACGG GCCTCCCCCA 420 

GGCAGGACCA ACCTGGGCAT GGTCCTGGGC 480 

ACCTCCCAGC CAGAGGCTGC CCCCACGCAG 540 

GTGATCCAGA GCTTTCTCOG CATGGTGCCC 600 

AGATCTATTA TQTGAAAGGC AGCTTCACCC 660 

GACCTGGGAG ACTTCAGGAA TGACAACCTA 720 

AGGTTTCTCA AGGAGCTGGA GACTTCAGGG 780 

AGGQGGCQAA COCAGCACAG AGGAAGAGGC 840 
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CTGCTCCACT TGTCTGGGAA CCTGGGCAGG AGGCACAGAG GAAGCCAAGG CCT0GA6CTG 900 

CAGGTCCCCC GGCATCTCTC TCTOTCOOGG CAGCCCAGGA TCGCCTGGTG CCCCCACCTG 960 

CTGCAOCAGG AGCCCCAAGG AGTGCTAGCT GAGGGTGCTT GCTGGGGTGG TCCTCATGGA 1020 

CAGTGAGGTG TGCAAGGGTG CACTGAGGGT GGTGQGAGGG GATCACCTGG GTTCCAGGCC 1080 

ATCCTTGCTG AGCATCTTTG AGCCTGCCTT COGGTGGGAG CAGAAAAGGC CAGACCCTGC 1140 

TGAGTTAQAG GCTGCTGGGA TCCACTOTTT CCACACAGCG GOAAGGCTGC TGGGAACAGG 1200 

TGGCAGAGAA OTGCCATOTT TGCGTTGAGC CTTGCAGCTC TTCCAGCTGG GGACTGGTGC 1260 

TTGCTGAAAC CCAGGAGCTG AACAGTGAGG AGGCTOTCCA CCTTGCTTGG CTCACTGGGA 1320 

CCAGGAAAGC CTGTCTTTGG TTAGGCTOGT GTACTTCTGC AGGAAAAAAA AAAAAGGATG 1380 

TGTCATTGGT CATGATATTT GAAAAGGGGA GGAGGCCGAA GTTGTTCCCA TTTATCCAGT 1440 

ATTGGAAAAT ATTTGACCCC CTTGGCTGAA TTCTTTTGCA GAACTACTGT GTGTCTGTTC 1500 

ACTACCTTTT CAGGTTTATT GTTTTTATTT TTGCATGAAT TAAGACGTTT TAATTTCTTT 1560 

GCAGACAAGG TCTAGATGCG GAGTCAGAGA TGGGACTGAA TGGGGAGGGA TCCTTTQTGT 1620 

TCTCATGGTT GGCTCTGACT TTCAGCTGTG TTGGGACCAC TGGCTGATCA CATCACCTCT 1680 

CTGCCTCAGT TTCCCCATCT GTAAAATGGG AGAATAATAC TTGCCTACCT ACCTCACGGG 1740 

GGTGTTGTGA GGATTCATTT GTGATTTTTT riTTTTTTTT TGTACAGAGC TTTTAAGCAT 1800 
TAAAAACAGC TAAATGTG 



Seq ID No: 173 Pggtfijq Sequence » 
Protein Accession 8> AK021806.1 



1 11 21 31 41 51 

I I I I I I 

TVLPLRADED FVflVTPRDKQ NLHENLQGLG PGVRVESLEL AIRKEIHDFA QLSENTYHVY 60 
HNTBDLWGBP HAVAIHGEDD LHVTEBVYKR PLFLQPTYRY HRLPLPEQGS PLEAQLDAFV 120 
SVLRBTPSLL QLRDAHGPPP ALVFSCQMGV GRTNLGMVLG TLILLHRSGT TSQPEAAPTQ 180 
AKPLPMSQFQ VIQSFLRMVP QGRRMVEBVD RSIM 



Seq ID NO: 174 DNA sequence , 

Nucleic Acid Accession #: NM_016S80.2 

Coding sequence: 1212-4766 (underlined sequences correspond to start and stop codons) 



1 11 21 , 31 41 51 

I I I I I I 

GGGAAGCGGG AGGAGAGCCA CACGGTCAAG TTGCACAGGT TCTTGCAGCT TCTGGAATCA 60 

AGACCATGGG CACCCTCATA AGTCAGTGTG GGCAGGGACT GCCCCAGGGC CAATCCAAGA 120 

TCCAGAGGTA GCCATAGGGT GTGACAAGTT GTGCAGATTA CAACACTCAC CCCTTGCAAT 180 

AACGTCACTG CCTGTGACTC GGGGCCAGGC CCAGGCCAAA GCCCTTCCTA CATCATTTCG 240 

TTTAATCCTC ACAGTTTCCT GCTGAAAGGG CTACTATTCT TACTCCCATC CCCACTCTAC 300 

AGATGAGGTA ATGGAGGCCC AGGAAAGTTA AGTGACTTQT CCCAGATGAC ACCGCTGGTA 360 

AGTTGCAAAG TCAGAATTTG AACTCAGGCA GTTTACCTCT GATGGCTGCT CTGTTAATCA 420 

CAGCTOCTTT CCAGTGAGAC AAAAACGGGT GATCAGGGCA GAGTCAAGAC AGAGAGGTAA 480 

ACAAGATTGG GAAAAAGACA GGAATGAGAG GGGAACAATG GGGGAAAAGA TAGGAACAAA 540 

GAGAGTTGGG GAAGGGGAGA GAAACAGGAA ACATGACTTG CCCGGGAGGG GCATCAGTCC 600 

ACGTGCAAGC AGGTGGAGGC TCAAGTTTTC TGCTCACTTG GTGATGCAGA GGCTCCCTTT 660 

CCCTCAGCAG CCGCCTTGCT GCGTGGACAG CAGCTTCCCA TCTGGCCTGT CCCCGGAGCC 720 

CCGGCCTCAT CCTCCTCAGC GGCAGGCCAC TTAGCTTCAC AGGAAATGCT CTTTCTCTAA 780 

TTGGCATTGA AACTCACAGC CCTCCCTTTT CCTGTAGGTG GGGTTTCCAT AGGAAAAAGC 840 

TGCTTCTCTG TTTCCCCAGC CTAGCAACTG TTTGGCAGTC AGAGTCCCAC ATCCTGCTCA 900 

ACTGGGTCAG GTCCCTCTTA GACCAGCTCT TGTCCATCAT TTGCTGAAGT GGACCAACTA 960 

GTTCCCCAGT AGGGGGTCTC CCCTGGCAAT TCTTGATCGG CGTTTGGACA TCTCAGATCG 1020 

CTTCCAATGA AGATGGCCTT GCCTTGGGGT CCTGCTTOTT TCATAATCAT CTAACTATGG 1080 

GACAAGGTTG TGCCGGCAGC TCTGGGGGAA GGAGCACGGG GCTGATCAAG CCATCCAGGA 1140 

AACACTGGAG GACTTGTCCA GCCTTGAAAG AACTCTAOTG GTTTCTGAAT CTAGCCCACT 1200 

TGGCGGTAAG CATSATGCAA CTTCTGCAAC TTCTGCTGGG GCTTTTGGGG CCAGGTGGCT 1260 

ACTTATTTCT TTTAGGGGAT TGTCAGGAGG TGACCACTCT CACGGTGAAA TACCAAGTGT 1320 

CAGAGGAAGT GCCATCTGGT ACAGTGATCG GGAAGCTGTC CCAGGAACTG GGCCGGGAGG 1380 

AGAGGCGGAG GCAAGCTGGG GCTGCCTTCC AGGTQTTGCA GCTGCCTCAG GCGCTCCCCA 1440 

TTCAGGTGOA CTCTGAGGAA GGCTTGCTCA GCACAGGCAG GCGGCTGGAT CGAGAGCAGC 1SO0 

TGTGGCGACA GTGGGATCCC TGCCTGOTTT CCTTTGATGT GCTTGCCACA GGGGATTTGG 1S60 

CTCTGATCCA TGTGGAGATC CAAGTGCTGG' ACATCAATGA CCACCAGCCA CGGTTTCCCA 1620 

AAGGCGAGCA GGAGCTGGAA ATCTCTGAGA GCGCCTCTCT GOGAACCCGG ATCCCCCTGG 1660 

ACAGAGCTCT TGACCCAGAC ACAGGCCCTA ACACCCTGCA CACCTACACT CTGTCTCCCA 1740 

GTGAGCACTT TGCCTTGGAT GTCATTGTGG GCCCTGATGA GACCAAACAT GCAGAACTCA 1800 

TAGTGGTGAA GGAGCTGGAC AGGGAAATCC ATTCATTTTT TGATCTGGTG TTAACTGCCT 1860 

ATGACAATGG GAACCCCCCC AAGTCAGGTA CCAGCTTGGT CAAGGTCAAC GTCTTGGACT 1920 

CCAATGACAA TAGCCCTGCG TTTGCTGAGA GTTCACTGGC ACTGGAAATC CAAGAAGATG 1980 

CTGCACCTGG TACGCTTCTC ATAAAACTGA CCGCCACAGA CCCTGACCAA GGCCCCAATG 204 0 

GGGAGGTGGA GTTCTTCCTC AGTAAGCACA TGCCTCCAGA GGTGCTGGAC ACCTTCAGTA 2100 

TTGATGCCAA GACAGGCCAO GTCATTCTGC GTCGACCTCT AGACTATGAA AAGAACCCTG 2160 

CCTACGAGGT GGATGTTCAG GCAAGGGACC TGGGTCCCAA TCCTATCCCA GCCCATTGCA 2220 
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AAGTTCTCAT CAAGGTTCTG GATGTCAATG ACAACATCCC AAGCATCCAC GTCACATGGG 2280 

CCTCCCAGCC ATCACTOGTG TCAGAAGCTC TTCCCAAGGA CAGTTTTATT GCTCTTGTCA 2340 

TGGCAGATGA CTTGGATTCA GGACACAATG GTTTGGTCCA CTGCTGGCTO AGCCAAGAGC 2400 

TGGGCCACTT CAGGCTGAAA AGAACTAATG GCAACACATA CATOTTGCTA ACCAATGCCA 2460 

5 CACTGGACAG AGAGCAGTGG CCCAAATATA CCCTCACTCT GTTAGCCCAA GACCAAGGAC 2520 

TCCAGCCCTT ATCAGCCAAG AAACA6CTCA GCATTCAGAT CAGTGACATC AACGACAATG 2580 

CACCTGTQTT TGAGAAAAGC AGGTATGAAG TCTCCACGCG GGAAAACAAC TTACCCTCTC 2640 

TTCACCTCAT TACCATCAAG GCTCATGATG CAGACTTGGQ CATTAATGGA AAAGTCTCAT 2700 

ACCGCATCCA GGACTCCCCA GTTGCTCACT TAGTAGCTAT TGACTCCAAC ACAGGAGAGG 2760 

10 TCACTGCTCA GAGGTCACTG AACTATGAAG AGATGGCCGG CTTTGAGTTC CAGGTGATCG 2820 

CAGAGGACAG CGGGCAACCC ATGCTTGCAT CCAGTGTCTC TGTGTGGGTC AGCCTCTTGG 2880 

ATGCCAATGA TAATGCCCCA GAGGTGGTCC AGCCTGTGCT CAGCGATGGA AAAGCCAGCC 2940 

TCTCCGTGCT TGTGAATGCC TCCACAGGCC ACCTGCTGGT GCCCATCGAG ACTCCCAATG 3000 

GCTTGGGCCC AGCGGGCACT GACACACCTC CACTGGCCAC TCACAGCTCC CGGCCATTCC 3060 

15 TTTTGACAAC CATTGTGGCA AGAGATGCAG ACTCGGGGGC AAATGGAGAG CCCCTCTACA 3120 

GCATCCGCAG TGGAAATGAA GCCCACCTCT TCATCCTCAA CCCTCATACG GGGCAGCTGT 3180 

TCGTCAATGT CACCAATGCC AGCAGCCTCA TTGGGAGTGA GTGGGAGCTG GAGATAGTAG 3240 

TAGAGGACCA GGGAAGCCCC CCCTTACAGA CCCGAGCCCT GTTGAGGGTC ATGTTTGTCA 3300 

CCAGTGTGGA CCACCTGAGG GACTCAGCCC GCAAGCCTGG GGCCTTGAGC ATGTCGATGC 3360 

20 TGACGGTGAT CTGCCTGGCT GTACTGTTGG GCATCTTOGG GTTGATCCTG GCTTTGTTCA 3420 

TGTCCATCTG CCGGACAGAA AAGAAGGACA ACAGGGCCTA CAACTGTCGG GAGGCCGAGT 3480 

CCACCTACCG CCAGCAGCCC AAGAGGCCCC AGAAACACAT TCAGAAGGCA GACATCCACC 3540 

TCGTGCCTGT GCTCAGGGGT CAGGCAGGTG AGCCTTGTGA AGTCGGGCAG TCCCACAAAG 3600 

ATGTGGACAA GGAGGCGATG ATGGAAGCAG GCTGGGACCC CTGCCTGCAG GCCCCCTTCC 3660 

25 ACCTCACCCC GACCCTGTAC AGGACGCTGC GTAATCAAGG CAACCAGGGA GCACCGGCGG 3720 

AGAGCCGAGA GGTGCTGCAA GACACGGTCA ACCTCCTTTT CAACCATCCC AGGCAGAGGA 3780 

ATGCCTCCOG GGAGAAOCTG AACCTTCCCG AGCCCCAGCC TGCCACAGGC CAGCCACGTT 3840 

CCAGGCCTCT GAAGGTTGCA GGCAGCCCCA CAGGGAGGCT GGCTGGAGAC CAGGGCAGTG 3900 

AGGAAGCCCC ACAGAGGCCA CCAGCCTCCT CTGCAACCCT GAGACGGCAG CGACATCTCA 3960 

30 ATGGCAAAGT GTCCCCTGAG AAAGAATCAG GGCCCCGTCA GATCCTGCGG AGCCTGGTCC 4020 

GGCTGTCTGT GGCTGCCTTC GCCGAGCGGA ACCCCGTGGA GGAGCTCACT GTGGATTCTC 4080 

CTCCTGTTCA GCAAATCTCC CAGCTGCTGT GCTTGCTGCA TCAGGGCCAA TTCCAGCCCA 4140 

AACCAAACCA CCGAGGAAAT AAGTACTTGG CCAAGCCAGG AGGCAGCAGG AGTGCAATCC 4200 

CAGACACAGA TGGCCCAAGT GCAAGGGCTG GAGGCCAGAC AGACCCAGAA CAGGAGGAAG 4260 

35 GGCCTTTGGA TCCTGAAGAG GACCTCTCTG TGAAGCAACT GCTAGAAGAA GAGCTGTCAA 4320 

GTCTGCTGGA CCCCAGCACA GGTCTGGCCC TGGACCGGCT GAGCGCCCCT GACCCGGCCT 4380 

GGATGGCGAG ACTCTCTTTG CCCCTCACCA CCAACTACCG TGACAATGTG ATCTCCCCGG 4440 

ATGCTGCAGC CAOGGAGGAG CCAAGGACCT TCCAGACGTT CGGCAAGGCA GAGGCACCAG 4500 

AGCTGAGCCC AACAGGCACG AGGCTGGCCA GCACCTTTGT CTCGGAGATG AGCTCACTGC 4560 

40 TGGAGATGCT GCTGGAACAG CGCTCCAGCA TGCCCGTGGA GGCCGCCTCC GAGGCGCTGC 4620 

GGCGGCTCTC GGTCTGCGGG AGGACCCTCA GTTTAGACTT GGCCACCAGT GCAGCCTCAG 4680 

GCATGAAAGT GCAAGGGGAC CCAGGTGOAA AGACGGGGAC TGAGGGCAAG AGCAGAGGCA 4740 

GCAGCAGCAG CAGCAGGTGC CTQTGAACAT ACCTCAGACG CCTCTGGATC CAAGAACCAG 4800 

GGGCCTGAGG ATCTGTGGAC AAGAGCTGGT TTCTAAAATC TTGTAACTCA CTAGCTAGCG 4860 

45 GCGGCCTGAG AACTTTAGGG TGACTGATGC TACCCCCACA GAGGAGGCAA GAGCCCCAGG 4920 

ACTAACAGCT GACTGACCAA AGCAGCCCCT TGTAAGCAGC TCTGAGTCTT TTGGAGGACA 4980 

GGGACGGTTT GTGGCTGAGA TAAGTGTTTC CTGGCAAAAC ATATGTGGAG CACAAAGGGT 5040 

CAGTCCTCTG GCAGAACAGA TGCCACGGAG TATCACAGGC AGGAAAGGGT GGCCTTCTTG " 5100 

GGTAGCAGGA GTCAGGGGGC TGTACCCTGG GGGTGCCAGG AAATGCTCTC TGACCTATCA 5160 

50 ATAAAGGAAA AGCAGTGATT CAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
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Seq ID No: 175 Protein aeauence; 
Protein Accession #: NP 057664.1 



11 21 31 41 51 



MMQLLQLLLG LLGPGGYLFIi LGDCQEVTTL TVKYQVSBSV PSGTVJGKLS QELGREERRR 60 

QAGAAFQVLQ LPQALPIQVD SEEGIiLSTGR RLDREQLCRQ WDPCLVSFDV LATGDLALIH 120 

60 VEIQVLDIND HQPRFPKGEQ ELBISBSASL RTRIPLDRAL DPDTGPNTLH TYTLSPSEHF 180 

ALDVIVGPDE TKHAELIWK ELDREIHSFF DLVLTAYDNG NPPKSGTSLV KVNVLDSNDN 240 

SPAFABSSUV LEIQEDAAPG TLLIKLTATD PDQGPNGEVE FFLSKHMPPE VLDTFSIDAK 300 

TGQVILRRPL DYBKNPAYEV DVQARDLOPN PIPAHCKVLI KVLDVNDNIP 8IHVTWASQP 360 

SLVSEALPKD SPIALVMADD LDSGHNGLVH CWLSQBLGHP RLKRTNGNTY MLLTMATLDR 420 

65 EQWPKYTLTL LAQDQGLQPL 8AKKQLSIQI SDINDNAPVF EKSRYEVSTR EKNLP3LHLI 480 

TIKAKDAOLG INGKVSYRIQ DSPVAHLVAJ DSNTGEVTAQ RSLNYEEMAG FEFQVIAEDS 540 

GQPMLASSVS VHVSUiDAND NAPBWQPVL SDGKAS LSVL VNASTGHLLV PIETPNGLGP 600 

AGTDTPPLAT HSSRPPLLTT IVARDAOSGA NGEPLYSIRS GNEAHLFILN PHTGQLFVNV 660 

TNA5SLIGSE NELBIWEDQ GSPPLQTRAL LRVMFVTSVD HLRDSARKPG ALSMSKLTVT 720 

70 CLAVLLGIFG LILALFM3IC RTEKKDNRAY NCREAESTYR QQPKRPQKHI QKADIHLVPV 780 

LRGQAGEPCB VGQSHKDVDK EAMMEAGWDP CLQAPFHLTP TLYRTLRMQG NQGAPABSRE 840 

VLQDTVNLLF NHPRQRKASR ENLNLPBPQP ATGQPRSRPL KVAGSPTGRL AGDQGSEEAP 900 

QRPPASSATL RRQRHLNGKV SPBKESGPRQ ILRSLVRLSV AAFAERNPVB ELTVDSPPVQ 960 

QI SQLLShhH QGQFQPKPNH RGNKYLAKPG GSRSAIPDTD GPSARAGGQT DPEQEBGPLD 1020 

75 PEEDLSVKQL LBEELSSLLD PSTGLALDRL SAPDPAWMAR LSLPLTTNYR DNVISPDAAA 1080 

TEEPRTPQTF GKAEAPELSP TGTRLASTFV SEMSSLLEML LEQRSSMPVE AASBALRRLS 1140 
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VCGRTLSLDL ATSAASGMKV QGDPGGKTGT EQKSRGSSSS SRCL 

Seq ID NO: 176 DNA sequence 
Wuclaic Acid Accession # : AL109712 . 1 

Coding sequence: 2-128 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAG2CTCTTT GGGCCAGCCG GGCTGCTGCA GACAGACAGG AAOCACGCCT GACGCTCCTC 60 

TACCCTCGGG CAGCACAGCG GGGCTGGGAC TCACTCTAOC TTGCCCAGCA ACTTGCTTTC 120 

CTGTGTGAAC TCTOGCAGOC TGCCCTCTCT GTGCAAAGCT GCCACTGGGG CCTGCTCAGG 180 

GTGGCCTGGA ACTTGGAOGT GGGCAGTCAG GGCCTAGGAT GGGCCTGTGT CACCAGGGCA 240 

TOTGCCCTTG GGCCAGTEAC TTCCTCTCAG AGCCTTGGGC TCCTCCTCTG AGGATGGGGC 300 

TTGTTOGTOT GAAATGAGQT GAGCATOTTG AGTTGGGGAG CAGCAGGACA CGCACCTGCA 360 

GGCAGCCGCC CTGGCCACGC TCCCTCCCTA CCTTCCGAGT CCTGGGACAG ACACAGTAGA 420 

GCACAGCGGG CCAGCCTGCT CTCTTCTCTG TCTACTTTTT GCAGAAGAGT CAACAGATAC 480 

AACAGGCCCA GGGAGGTGCC CCTGGOGGCC CCAGTCCCCA TCACTCCAAG GGGCAGTCCT S40 

GCAAGTGACA AGGTGGGCCC AATCCCTGTG GAACAGGTCT CTGAGGACCA CAGAGTGGGG 600 

CCCCAGGGAA AGCTGGGAGC CGAGCTAGAG GCAGGCAGCA AGTAAGGGCA AAGCTGTGCC 660 

CCTGCCCGGA AGACCTTCCT GCCCCCAGAA CCCGACCCTC CGCAGATAGC CCTCCCTGGG 720 

CAGCAGCCCC CCAGCTTCCA AGGCCCGTGC CTCACCAGAC GCCATGCTCT CACGGACTTG 780 

TTTGCTGCTC TGTACCCTGC AGATCTGCCC CAGAGGAGCA GGTGAAAAGC CGCGCCTGCC 840 

GAGGTGCTGT GGCGGTGGAG TTTTGGGCAG AGGAGTGGGG GGAAGAGTTT CTCACTTTTA 900 

AGATTCTCCA AATCCAAGAT GAAGTCATGC TGTGCTTTGG AATGGTAGAT GCTCATTTAT 960 
GTAAAATCAT AATAAATGTT ACACAAACTG TTAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID No: 177 protein sequence^ 
Protein Accession it AL109712.1 

1 11 21 31 41 51 

I I I I I I 

VSLGQPGCCR QTGSTPDAPL PSGSTAGLGL TLACPATCFP V 

Seq ID NO: 178 pNA sequence. 
Nucleic Acid Accession #: none found 

Coding sequence: 3-107 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I 1 I I I 

AATGGAGCAC TCCAAAGAAC GATTTGACCA ATAGCATTTC TTCTCTGGGG GTTGTATTTC 60 

AAAGCATGCA ACTCTCCAGG GAACCAGAAC TAAATTGCTT AAAATGAAGT CATTCCTCAG 120 

ATTAACTTCC TCAGATAAAG TGTCAGCGGT CTGCAGAAAC GAAGAAGACA AAACTGAGAT 180 

TATCACTCAT AATTCTCTTA CTTACTATGT CAGTGAAACA ATGAGTTTGC ATTTTTGCAA 240 

TCCTAGAACA TTCTTCATTA GCCCTGGGTC ATGACCTCTT CCAGTTAATT CTCTTTCACA 300 

CCTTTAGGAA AGATTTAAGA TGAACCTTCA ATAGGATATT AACATAACTC ATAGCCAATA 360 

CCACAGCTGC CTTTCAAATT AATOAGGTTA ATTGTTCTCC AGCAAACATG AGTTTGTCTT 420 

TGGCATTTTA AATGCTTCCC ATTGATCTGA CATTTTGCTG TTTCAAGTTT TAAAGGGCTC 480 

AAATCAAAGA CTATTGATAA CTGAGCAAAG AGCGAAGATC CAGAAATACG AAAACATTGT 540 

crrrrmTr ccatgaaaaa caatcatagc c t t ttgaatt caatcgaagt ttctacatta 600 

GCCATCTAAG ACTTATTTAA TTATTTCTGT TCTCAGTCAA GCTAATTCAA GTGAATGAAC ' 660 

AGTATTGACT TTTAAAATCT TTTTTAAATT TTTTTAAATC TTTAGTTTAT TAAGTTTGTA 720 

GAAAAGCTCT GGGGCCATGA CCACTTACGT AAATGTTTCA GTTTAAAAAC AAAAGATTCA 780 

GGCCTCTAAT TTGAGCCAAA TCCAGGTGAT CTTGTTTGAA ATTTTTGATG AATTTGAAAA 840 

GATGAAAGTG GAACTTTTAA CATTCATGTT CCCCAAATTT TTCACTGGGA AGGGATGCTA 900 

ATTGCCTACT TAAGATATAA GTTCAAGAAT AACATTTTCA TAGAAAATTC AGAAAACTGC 960 

TTGACACAGC AGTGACATAG TTAGATGTGG CTCAGATGCC TTCCAAACCT GAGGGTCCCC 1020 

AAAGATTTCT TTACCAGTTG TTTTTAACTA TGAATCTTAA TCTTGTTCAT TCCCCTGCCA 1080 
. AAACAAATTT AAAAG 

Seq ID No: 179 protein peguepce: 
Protein Accession #t none found 

1 11 21 31 41 51 

I i I I I I 

WSTPKNDLTN SISSLGWFQ SMQLS REPEL NCLK 

Seq ID NO: 180 pNft sequence 
Nucleic Acid Accession #: none found 

Coding sequence: 2-176 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 
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CCGgGTGGGG CCTCGGGATG CAGGCGCCGG TGCCCGGGCC CCTGGGCCTQ CTGGACCCOG 60 

CAGAAGGGCT TTCGAGGAGG AAGAAGACGT CGCTCTGGTT TGTGGGGTCT CTGCTGCTGG 120 

TGTCCGTCCT CATAGTCACC GTCGOGCTGG CTGCATCAGC AGGACGGAGA ATGTGACCGT 180 

TGGGGGCTAC TACCCAOGGA TCATTCTCGG CTTTGGATCT TTCTTAGGAA TTATTGGCAT 240 

CAACTTGGTC GAGAATAGAA GGCAAATGCT GGTGGCAGCG ATCGTGTTTA TCAOTTTTGG 300 

CGTGGTGGCC GCCTTCTGCT GCGCCATCGT GGACGGCGTA TTTGCAGCAC AGCACATTGA 360 

ACCGAGGCCC CTCACCACGG GAAGATGCCA GTTTTACTCC AGTGGGGTGG GGTACTTGTA 420 

CGATGTCTAC CAGACAGAGG TGAGCAGGAG CACTGAGATT CATGTGGGTT TTGCTCAGCT 480 

AACCCCGCCG ACCCCACGCG GTTTTCCCTG CACATAGGCG TGGTCTGAAT ATTTGGATTC 540 

TAATAGTTCC TGGGGGTCAC CCCTGCAGCT GGTGAACCGT TGATGCCCCC TGTGTAAGGG 600 

ACCTTGACAT TTCGATGTGC TGTATTTCAC TCTGGAGTCA GAGTTCTGGA CTTGCTTCAT 660 

TAAATCACAA CAGTCTCAGA AAACAACCGC ACCACCCCGC AATCCCACCA AAGGGGCGCG 720 
CCGTCCCTAA GAGTTATCCC 

Seq ID No: 181 Protein sequence i 
Protein Accession #: none found 

I 11 21 31 41 51 

II I III 
RVGPRDAGAG ARAPGPAGPR RRAFEBEEDV ALVGGVSAAG VRPHSHRRAG CISRTENV 

Seq ID NO: 182 pNA sequence 

Nucleic Acid Accession #: AX001579.1 

Coding sequence: 1150-2637 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 SI 
1111 J I 

TTTTCTCTGC TTTTCGCTAC CCCGGTCACT CTCATTTCTC TCCCCTATTC CTTGTCTCTT 60 

CCCCCATCCC CCTTTCTCCT GTCCTCCCCC TGCCTCTACA GTGGTTCTCC COGCTGAGCT 120 

GCCACCAGCT GCTGGGCCCC GGGCTGCTGC GGCTGGGCCG CCTATGGCTG CGGTCCCCCT 180 

CCCATACAGC CCCGGCCCCT GGTCTCTGGC TGTCAGGGTT TGGCCTCCTT CGTGGTGACC 240 

ACCTCTTCCT GTGCTCAGCG CCGGGCCCAO GCCCCCCAGC CCCTGAGGAC ATGGTGCATC 300 

TGCGGCGGCT ACAGGAGATC AGTGTGGTTT CTGCAGCTGA CACCCCAGAT AAGAAAGAGC 360 

ATTTGGTCCT GGTGOAGACA GGAAGGACCC TGTATCTGCA AGGAGAGGGC CGGCTGGACT 420 

TCACGGCATG GAACGCAGCC ATTGGGGGCG CGGCTGGTGG GGGCGGCACA GGGCTGCAGG 480 

AGCAGCAGAT GAGCCGGGGT QACATCCCCA TCATCGTGGA TGCCTGCATC AGTTTTGTTA 540 

CCCAGCATGG GCTCCGGCTG GAAGGTGTAT ACCGGAAAGG GGGOGCTCGT GCCCGCAGCC 600 

TGAGACTCCT GGCTGAGTTC CGTCGGGATG CCCGGTCGGT GAAGCTCCGA CCAGGGGAGC 660 

ACTTTGTGGA GGATGTCACT GACACACTCA AACGCTTCTT TCGTGAGCTC GATGACCCTG 720 

TGACCTCTGC ACGGTTGCTG CCTCGCTGGA GGGAGGCTGC TGGTATTCCT AAGATCCCTG 780 

AGAGCCAAGG CCCAACCAGG ATCTCTGCCT TCCCCCACCA GAATCCATGG TTTGGCAGCC 840 

CTCCGCCCCA TCACTTCCCA CCCTGGGGGA TCATCCAGAG ACTTGGCTCA GGGGGAGGTG 900 

GGAAGGGGGC AGAGACACAT CCATCCTGCA TTTGTGCCTA AAAATCCCTC CCTCTGTACC 960 

AGCTGCCACT CTTTCTTCCC GGGTCCTCCC CAACCCTCCT CCATTCCATC CCCAGAGCTG 1020 

CCCCAGAAGA ATCAGCGCCT GGAGAAATAT AAAGATGTGA TTGGCTGCCT GCCGCGGGTC 1080 

AACCGCOGCA CACTGGCCAC CCTCATTGGG CATCTCTATC GGGTGCAGAA ATGTGCGGCT 1140 

• CTAAACCA GA TG TGCACGCG GAACTTGGCT CTGCTGTTTG CACCCAGCGT GTTCCAGACG 1200 

GATGGGCGAG GGGAGCACGA GGTGCGAGTG CTGCAAGAGC TCATTGATGG CTACATCTCT 1260 

GTCTTTGATA TCGATTCTGA CCAGGTAGCT CAGATTGACT TGGAGGTCAG TCTTATCACC 1320 

ACCTGGAAGG ACGTGCAGCT GTCTCAGGCT GGAGACCTCA TCATGGAAGT TTATATAGAG 1380 

CAGCAGCTCC CAGACAACTG TGTCACCCTG AAGGTGTCCC CAACCCTGAC TGCTGAGGAG 1440 

CTGACTAACC AGGTACTGGA GATGCGGGGG ACAGCAGCTG GGATGGACTT GTGGGTGACT 1500 

TTTOAGATTC GCGAGCATGG GGAGCTGGAG CGGCCACTGC ATCCCAAGGA AAAGGTCTTA 1S60 

GAGCAGGCTT TACAATGGTG CCAGCTCCCA GAGCCCTGCT CAGCTTCCCT GCTCTTGAAA 1620 

AAAGTCCCCC TGGCCCAAGC TGGCTGCCTC TTCACAGOTA TCCGACGTGA GAGCCCACGG 1680 

GTGGGGCTGT TGCGOTGTCG TGAGGAGCCA CCTCGCTTGC TGGGAAGCCG CTTCCAGGAG 1740 

AGGTTCTTTC TGCTGCGTGG CCGCTGCCTG CTGCTGCTCA AGGAGAAGAA AAGCTCTAAA 1800 

CCAGAACGGG AGTGGCCTTT GGAAGGTGCC AAGGTCTACC TGGGAATCCG CAAGAAGTTA 1860 

AAGCCCCCAA CACCGTQGGG CTTCACATTG ATACTAGAGA AGATGCACCT CTACTTGTCC 1920 

TGCACTGACG AGGATGAAAT GTGGGATTGG ACCACCAGCA TCCTTAAAGC CCAGCACGAT 1980 

GACCAGCAGC CAGTGGTCTT ACGACGCCAT TCCTCCTCTG ACCTTGCCCG TCAGAAGTTT 2040 

GGCACTATGC CTTTGCTGCC TATCCGTGGG GATGACAGTG GAGCCACCCT CCTCTCTGCC 2100 

AATCAGACCC TGOGGCGACT ACACAACCGG AGGACCCTGT CCATGTTCTT TCCAATGAAG 2160 

TCATCCCAGG GGTCTGTGGA GGAGCAAGAG GAGCTGGAGG AGCCTGTGTA CGAGGAGCCA 2220 

GTGTATGAGG AAGTAGGGGC CTTCCCTGAG TTGATCCAGG ACACTTCTAC CTCCTTCTCC 2280 

ACCACACOGG AGTGGACAGT GAAGCCAGAG AACCCCCTCA CCAGCCAGAA GTCATTGGAT 2340 

CAACCCTTTC TCTCCAAGTC AAGCACCCTT GGCCAGGAGG AGAGGCCACC TGAGCCCCCT 2400 

CCAGGCCCCC CTTCAAAGAG CAGTCCCCAG GCACGGGGGT CCCTAGAGGA ACAGCTGCTC 2460 

CAGGAGCTCA GCAGCCTCAT CCTGAGGAAA GGAGAGACCA CTGCAGGCCT GGGAAGTCCT 2520 

TCCCAGCCAT CCAGCCCCCA ATCCCCCAGC CCCACTGGCC TTCCAACACA GACACCTGGC 2580 

TTCCCCACCC AACCCCCATG CACTTCCAGT CCACCCTCCA GCCAGCCCCT CACATSA.CCC 2640 

TAGGACCAGC AGTCTGAGAG GGTAGGTACC AGAAGACCCA GAAACTCTTA TCGTGGCACT 2700 

GTTGCAGCTT CCTCTGCCCT GGCTGGAAAG ACTCCAGAAT CCAGTGTGGT GCTGTGGAAG 2760 

GAGCACTGGA CTAAAGGCTT CAGTGGCTGC GTGTCCCAGG ACAGGTCATG GCCCCTCTCT 2820 

GGGCCCAGCC CATTTATCTA TACCATGAGG TAACTGAAGT AAGGAGAGCA GTGAATGTCA 2880 
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AACTGTGTTT CTTAGAGCCA TAAGCCCCAC ATATTATCCC TGAACAAGGG CAGCTCCTGC 2940 

TTTATATATT TGATACGTAG GOOTTCCATG AGAGATTTTG GGTTTTAAAG GAATGGTTTT 3000 

ACTGCATTAA AGAAAAAAAA TGCTTTGGAA ACCAQAGGCC TGGGTGATGT TAAAGTCTAT 3060 

CCTOTCCCAC TTCCTACATT CTOGGACTAC CGTCAAGCCT GGAGTAGGGA GAQCGAGTTT 3120 
GGGAGCTGGG ACTCGGGGAG TCAAAAATAG ATGAGTAATT GTCAATAAAC CTGGGAACC 



Seq ID No: 183 protein sequence; 
Protein Accession 8: AK001S79.1 



1 11 21 

i 1 I 

MSLTHSNASF VSSWTLPLHG CCLAGGRLLV 
SHPGGSSRDL AQGEVGRGQR HIHPAFVPKN 

rlbkykdvig clprvnrrtl atlighlyrv 
KSVRVLQBXjI dgyisvpdid sdqvaqidle 
NCVTUCVSPT LTAEBLTNQV LEMRGTAAGM 
WCQLPBPCSA SLLLKKVPIA QAGCLFTGIR 
RGRCLLLLKB KKSSKPERBW PLEGAKVYLG 
EMWDWTTSIL KAQHDDQQPV VLRRHSSSDL 
RLHNRRTLSM FFPMKSSQGS VEEQEELEEP 
TVKPSNPLTS QKSLDQPFLS KSSTLGQEER 
LILRKGETTA GLGSPSQPSS PQSPSPTGLP 



31 41 SI 

I I I 

FLRSLRAKAQ PGSLPSPTRI HGLAALRPIT 60 

PSLCTSCHSP FPGPPQPSSI PSPEIiPQKNQ 120 

QKCAALNQMC TRULALLFAP SVFOTDGRGB 180 

VSLITTWKDV QLSQAGDLIM BVYIEQQLPD 240 

DLWVTFEIRH HGSLERPLHP KEKVLEOALQ 300 

RESPRVGLLR CRKEPPRItLG SRFQBRPPUj 360 

IRKKLKPPTP WGFTLILEKM HLYLSCTDED 420 

ARQKFGTMPL LPIRGDDSGA TLLSANQTLR 480 

VYBBPVYEBV GAPPELIQDT STSPSTTREW 540 

PPEPPPGPPS KSSPQARGSL SEQIiLQELSS 600 
TQTPGFPTQP PCTSSPPSSQ PLT 



Seq ID NO: 184 DNft sequence 

Nucleic Acid Accession #s none found 

Coding sequence: 1-81 (underlined sequences correspond to start and stop codons) 



l 11 21 

I I I 

GTAGAGTTAG TGTCAATGTG CTTAGAATAT 
GTATTAAGCT TAAAAAGI TA ATTCAGTTTA 
AATCTCAACA TAAGAAGTCA AAATGTAATG 
CTTTCTCTAT AATTTCATCA QTATGTCCTC 
AACCCTAACT CTGCTAATTA TAAGCTAQGC 
CACTCCACCA QCTTTGTTAT CTGTAAAATG 
ACTOAAGATG AGAGAACATG ATATGTGTAA 
AACATGTAAT GAATGTAGTA ATAGTAATTA 
TGTTCAGCTG TAACAGAATA CCCAAAATAA 
AAGTTTTGTT ACGAATTCAG ACAATCCAGG 
ACAAAGGCAT CTTTCCTGAT TTCTGCCAGT 
GGATGQCAGT TCCAGCCCTG GTTACGCCCA 
TGTGCCTCTT CACTTTAATC ATAGCTCCCA 
CCATTAGCTA ATGCTTGCTT ACATGGTCAC 
GTCATGTGCT CAATTAATAT CCAAGTGTCC 
TTGCTTGGTT GCATTCTTCT TAGCATAAGC 
ACTTGGATGC CTCAGTTGTC CTTTCATTTA 
TTCTTTTGTC ATCAGCACCA TCACTACCAC 
CAGGATGGTT GCAACAACCA CCATAGGGAC 
AGTAAGCTTT TGAAAATGTA GGTCAGATCA 
TGGCTTTTCA TATTACTCAA AAGAAAACCT 
GCTTATTCTT CCTCTTACTT TATCTCTGTA 
CACCTCCTTG CTGCTTGTCC TATACTCCTA 
GCACTTAAAA TAGAAAAAAA AAAAAAAAAA 
TGCAGGTTAG AAGTCATGGA GCTGGGATCT 
CACCGTTCTA TTCAACCCCA TTGCCTAGAG 
GACACAGTCA OCTCTTTCTG AGAAAAGGCA 
TTTTGCAGAA GAAAAC 



31 41 51 

I I I 

ACCAAATTCA TAAACATTTT CTCTAAAAAA 60 

AGGAATATAA ACCAAATTAT TTTATATTTG 120 

CTGCCAGATA ACAATATCAA AGGTATTTTT 180 

TCCCTTTTCT CCTATTTGTC AAATTTTAGC 240 

AAGTAATCTT GGACAAGTTA TTTGACCTCT 300 

ATGATAATAC CAACACCTTC TTCTTGGGGT 360 

AGTGCCTTCC ACAATACCCA GAACATAGCA 420 

TTTTATTTTC TTTTGATTCA GTTGGGACTA 480 

CAGTTTTAAA CAAATTAAAG TTTTGTTGTG 540 

GCTTTTATAG ATGCACCAGO ATCAGCAGGT 600 

CTCAATGCAT GGGTTGCAAT CCAGAGTCCA 660 

TATTAGCACA CAGAAAGAAA GAGAAAGGGA 720 

CTAGATGCAC CCACTACTTC TGCTGATACT 780 

ACTTAGTTTC CAGAGAGACA TGTCTGGACA 840 

AATTACTGAG AAAAAAAGAA ACTAGCACCT 900 

CACATTCTTT TTATGAAGTT GTCCTCAGTT 960 

GAAATGCTCC TTGGACATCC TGAA TCTGAC 1020 

TGCCTTCTTC AAAGCCACCA CGTTCTGTCC 1080 

TTTTTGCTTC TACTTCCACA CAATAGCCAG 1140 

TGTCTCTCTC TTCTCTTCAA AACCCTCCGA 1200 

AAAACTTTOC TGTGAGATCT ATGTGACCCG 1260 

TTGCTCTTCC TCACTCTACT CCAGCCATCC 1320 

AAAGAAGTTC AGTCTTCCCT TATGATATTT 1380 

AGCTCAGAGA GGCTGAGTTG TCCAAGGTCA 1440 

AAATCCATGT CAGTCTGACT ATGAGTTCTG 1500 

GTGCTTGATT GCTCAATAAT AGATTCCATG 1560 

GCTCAGCATT TCCATGAGAT CCGCACATCC 1620 



Seq ID Noj 185 Protein sequence; 
Protein Accession #: none found 



1 11 21 31 41 51 

I I I I I I 

VELVSMCLBY TKPINIPSKK VLSLKS 



Seq ID NO: 186 DNA sequence 

Nucleic Acid Accession #t NM_002203.2 

Coding sequence: 43-3588 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTGCAAACCC AGCGCAACTA CGGTCCCCCO GTCAGACCCA GGATGGGGCC AGAACGGACA 60 
GGGGCCGCGC CGCTGCCGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 120 



266 



WO 02/079492 



PCT/US02/04915 



TGTTTOGCCT ACAATGTTGG TCTCCCAGAA 
CAGTTTGGGT ATGCAGTGCA GCAGTTTATA 
TCACCCTGGA GTGOCTTTCC TGAGAACCGA 
CTATCCACTG CCACATGTGA AAAACTAAAT 
5 ACTGAGATGA AAACCAACAT GAGCCTCGGC 
GGTTTTCTCA CATGTGGTCC TCTGTGGGCA 
GGTGTGTGTT CTGACATCAG TCCTGATTTT 
CAGCCCTGCC CTTCCCTCAT AGATGTTGTG 
CCTTGGGATG CAGTAAAGAA TTTTTTQGAA 

10 ACAAAGACAC AGOTGOOGTT AATTCAGTAT 
AACACATATA AAACCAAAGA AQAAATQATT 
GGGGACCTCA CAAACACATT OGGAGCAATT 
GCTTCTGGTG GQCOAOGAAO TGCTACGAAA 
CATGATGGTT CAATGTTGAA AGCTGTGATT 

15 TTTGGCATAG CAGTTCTTGO GTACTTAAAC 
AAAGAAATAA AAGCGATCGC TAGTATTCCA 
GAAGCAGCTC TACTAGAAAA GGCTGGGACA 
ACTGTTCAAG GAGGAGACAA CTTTCAGATG 
TACTCTTCTC AAAATGATAT TCTGATGCTG 

20 ACCATTGTCC AGAAQACATC TCATGGCCAT 
ATTCTGCAGG ACAGAAATCA CAGTTCATAT 
GGAGAAAGCA CTCACTTTGT TGCTGGTGCT 
CTATATAGTG TGAATGAGAA TGGCAATATC 
ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT 

25 GACGTGCTCT TGGTAGGTGC ACCAATGTAC 
GTCTACCTGT TTACTATCAA AAAGGGCATT 
GAGGGCATTG AAAACACTCG ATTTGGTTCA 
GATGGCTTTA ATGATGTGAT TGTTGGTTCA 
TACATTTACA ATGGTCATCA GGGCACTATC 

30 TCCGATGGAG CCTTTAGGAG CCATCTCCAG 
GATTTAAATG GGGATTCCAT CACCGATGTG 
CTCTGGTCAC AAAGTATTGC TGATGTAGCT 
ACTTTGGTCA ACAAGAATGC TCAGATAATT 
CCTACTAAGC AAAACAATCA AGTGGCCATT 

35 TTTTCATCCA GAGTAACCTC CAGGGGGTTA 
AAGAATATGG TAGTAAATCA AGCACAGAGT 
CCCTCTGATG TTGTCAACTC TTTGGATTTG 
ACTAGCCCTG CCCTTGAAGC CTATTCTGAG 
AAAGACTGTO GTGAGGATGG ACTTTGCATT 

40 CCAGCTGCTC AAGAACAACC CTTTATTGTC 
GTAACACTGA AAAATAAAAG GGAAAGTGCA 
GAAAACTTGT TTTTTGCATC ATTCTCCCTA 
GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC 
CAACAGGTGA CTTTTACTAT TAACTTTGAC 

45 TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC 
AACCTCAAAA TTCCTCTCCT GTATGATGCT 
AATTTTTATG AAATCTCTTC GGATGGGAAT 
GTTGGTCCAA AATTCATCTT CTCCCTGAAG 
GCAACTGTAA TCATCCACAT CCCTCAGTAT 

50 ACTGGGGTGC AAACAGACAA GGCTGGTGAC 
AAAATAGGAC AAACATCTTC TTCTGTATCT 
GAATTGAACT GCAGAACTGC TTCCTGTAGT 
ATGAAAGGAG AATACTTTGT TAATGTGACT 
TCAACGTTCC AGACAGTACA GCTAACGGCA 

55 ATATATGTGA TTGAAGATAA CACTGTTACG 
AAAGCCGAAG TACCAACAGG AGTTATAATA 
TTAGCTCTGG TTGCAATTTT ATGGAAGCTC 
ACCAAAAATC CAGATGAGAT TGATGAGACC 
ACCTGCAGTG GGAACCGGCA GCATCCCAGC 

60 TTTTTAAATC CCATATTTTT TTTATCATGT 
AAAACTGCAG GTCAGTTTGG ATGAAGAAAT 
GTAGGGAAAT AATAGGGAAA ATACCTATTT 
AACTGGCTGG CCCAGAGTTT ACATTCTAAT 
CAAGCATGAC AACTTTTAAA GAAAAATATG 

65 TCTCTTTAAA ATATTTGTCT TTAAACAGCA 
GTACTTCCAC TTGTGTATAT TTTAATGAAT 
ACAGGTTTTT TCAATTTATG CTGCTCATCC 
TAATTTTATT TATAAACTAG GTAAAATTTG 
CTTCCACACC CCATCTTGCT CTAATGATCA 

70 TACCTCCTAT ATGTCCATTT AAGTTAGGAG 
TTTTGTTTAA AACTCAGAAT ATAACATTTA 
. TGTGCCAGAG GAAGGAAAAG GAGGAAATTT 
CTTCTAGGAT TTGTTTGGCT GACTGGCAGT 
TTCTTTGGCA ACCTTCCTCC TCCCTTACTG 

75 TATTATAGAA GCCCTCTACA GCCTGACTTT 
TTACCCCTCA TCCAAAGTTC CCACTCCTTC 



GCAAAAATAT TTTCOSGTCC TTCAAGTGAA 180 

AATCCAAAAG GCAACTGGTT AJCTGGTTGGT 240 

ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 

TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

CAGCAATGTG GGAATCAGTA TTACACAACG 480 

CAGCXCTCAG CCAGCTTCTC ACCTGCAACT 540 

OTTOTST GTQ ATGAATCAAA TAGTATTTAT 600 

AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 

GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

GTAGCAACAT CCCAGACATC CCAATATGGT 780 

CAATATGCAA GAAAATATGC CTATTCAGCA 840 

GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

GATCAATGCA ACCATGACAA TATACTGAGG 960 

AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

TTAGGAGAAC AAATTTTCAG CATTGAAGGT 1140 

GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

GGTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 

TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

TTAGGTTACT CTGTQGCTGC AATTTCTACT 1380 

CCTCGGGCAA ATTATACCGG CCAGATAGTG 1440 

ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 

TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GCAATTGCAG CTCTTTCAGA CATCAACATG 1740 

CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 

CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

TCTATTGGTG CCTTTGGACA A3TGGTTCAA 1980 

ATAGAAGCTT CATTCACACC AGAAAAAATC 2040 

CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 

GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 2220 

TGCCCCGAGC ACATCATTTA TATACAGGAG 2280 

OGTGTGGACA TCAGTCTGGA AAACCCTGGC 2340 

ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

TACAACACTG GAATTGTTGT TGATTTTTCA 2580 

CCGQTTGATG GGACAGAAGT AACATGCCAG 2640 

GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 

TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

GAAATTCACT TAACAAGATC TACCAACATA 2880 

GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 2940 

GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

ACCAAAGAAA AGAACCCACT GATGTACCTA 3060 

ATCAGTTGTA ATGCAGATAT CAATCCACTG 3120 

TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

AATGTTACCT GCTGGTTGAA AGACGTTCAC 3240 

ACCAGAATTT GGAACGGGAC TTTCGCATCA 3300 

GCTGCAGAAA TCAACACCTA TAACCCTGAG 3360 

ATTCCCCTGA TGATAATGAA ACCTGATGAG 3420 

GGAAGTATAA TTGCTGGAAT C CTTTTQ CT0 3480 

GGCTTCTTCA AAAGAAAATA TGAAAAGATG 3540 

ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

TATATGATGG GGGAAAAAAA GTAATCTTTA 3840 

TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 

ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

ACTACAGAAG TGGAAGTGCT TGATATGTAA 4020 

ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4060 

AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 

TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

AAACATGCTT GAATAACTGA GCTTAGAGTA 4260 

AGGGGGCGAT ATAGAGACTA AGGCACAAAA 4320 

TGTAAAATCC CATCIGCTAG AAGCCCATCC 4380 

CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 

AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 

AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 

CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 4620 

AGGACAGCTG CTCTGCATTA GATATTAGGG 4680 
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GGGAAAGTCA TCTOTTTAAT TTACACACTT GCATGAATTA CTGTATATAA ACTCCTTAAC 4740 

TTCAGGGAGC TATTTTCATT TAGTGCTAAA CAAGTAAGAA AAATAAGCTA GAGTGAATTT 4800 

CTAAATGTTQ GAATOTTATG GGATGTAAAC AATGTAAAGT AAAACACTCT CAGGATTTCA 4860 

CCAGAAGTTA CAGATGAGGC ACTGGAAACC ACCACCAAAT TAGCAGGTOC ACCTTCTGTO 4920 

5 GCTGTCTTGT TTCTGAAGTA CTTTTTCTTC CACAAGAOTG AATTTGACCT AGGCAAGTTT 4980 

GTTCAAAAGG TAGATCCTGA OATGATTTGQ TCAGATTGGQ ATAAGGCCCA OCAATCTGCA 5040 

TTTTAACAAG CACCCCAGTC ACTAGGATGC AGATGGACCA CACTTTGAGA AACACCACCC 5100 

ATTTCTACTT TTTGCACCTT ATTTTCTCTG TTCCTGAGCC CCCACATTCT CTAGGAGAAA 5160 

CTTAGATTAA AATTCACAGA CACTACATAT CTAAAGCTTT GACAAGTCCT TGACCTCTAT 5220 

10 AAACTTCAGA GTCCTCATTA TAAAATGGGA AGACTGAGCT GGAGTTCAGC AGTGATGCTT 5280 

TTTAGTTTTA AAAGTCTATG ATCTQATCTG GACTTCCTAT AATACAAATA CACAATCCTC 5340 
CAAGAATTTG ACTTGGAAAA G 

Seq ID NOi 187 Proteia sequence: 
15 Protein Accession if: HPJ>02194.1 

1 11 21 31 41 51 

I 1 I I I I 

MGPERTGAAP LPLLLVLALS QGILNCGLAY NVGLPBAKIP SGPSSEQFGY AVQQFINFKG 60 

20 NWLLVGSPW8 GFPENRMGOV YKCPVDLSTA TCEKLNLQTS TSIPNVTSMK TNMSLGLILT 120 

• RNMGTGGPLT CGPLWAQQCO NQYYTTQVCS DISPDFQLSA SPSPATQPCP SLIDWWCD 180 

ESNSIYPWDA VKNFLEKFVQ GLDIGPTKTQ VGLIQYANNP RWFNLNTYK TKKKMIVATS 240 

QTSQYGGDLT KTFGAIQYAR KYAYSAASGG RRSATKVMW VTDGBSHDGS MLKAVIDOCN 300 

HDNILRFGIA VLGYLNRNAL DTKNLIKEIK AIASIPTERY FFNVSDEAAL LEXAGTLGEQ 360 

25 IFSIBGTVQG GDNFQMKMSQ VGFSADYSSQ NDILJKLGAVG AFGWSGTIVQ KTSHGHLIFP 420 

KQAFDQILQD RNHSSYLGYS VAAISTGEST HFVAGAPRAU YTGQXVLYSV NBNGNIT7IQ 480 

AHRGDQIGSY FGSVLCSVDV DKDTITDVLL VGAPKYMSDL KKEEGRVYLF TIKKGILGQH 540 

QPLEGPEGIE NTRFGBAIAA LSDINMDGFN DVIVGSPLEN QNSGAVYTYN GHQGTIRTKY 600 

SQKILGSDOA FRSHLQYFGR SLDGYGDLNG DSITDVSIGA FGGWQLWSQ SIADVAIKAS 660 

30 FTPEKITLVN KNAQIILKLC FSAKFRPTKQ NNQVAIVYNI TLDADGFSSR VTSRGLFKKN 720 

NERCLQKNMV VNQAQSCPBH IIYIQEPSDV VNSLOLRVDI 8LENPGTSPA LEAYSSTAKV 780 

FSIPFHKDCG EDGLCISDLV LDVRQIPAAQ BQPFIVSNQN KRLTFSVTLK NKRESAYNTG 840 

IWDFSBNLF FASFSLPVDG TEVTCGVAAS QKSVACDVGY PALKREQQVT FTINFDFNLQ 900 

NLQNQASLSF QAI*SESQEEN KADMLVNLKI PLLYDAEIHL TRSTNINFYE ISSDGNVPSI 960 

35 VHSFEDVGPK PIFSLKVTTG SVPVSMATVI IHIPQYTKEK NPLMYLTGVQ TDKAGDISCN 1020 

ADINPLKIGQ TSSSVSPKSE NPRHTKEUIC RTASCSNVTC WLKDVHMKGE YFVNVTTRIW 1080 

NGTFASSTFQ TVQLTAAAEI NTYNPBIYVI EDNTVTIPLM IMKPDEKAEV PTGVIIGSII 1140 
AGILLLLALV AILWKLGFFK RKYEKMTKNP DSIDBTTELS S 

40 



Seq ID NO: 188 
Nucleic Acid Accession #: NM_00221G.l 

Coding sequence: 42-3188 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

45 | | | | | | 

GGCTACCGCT CCCGGCTTGG CGTCCOGCGC GCACTTCGGC GATGGCTTTT COGCCGOGGC 60 

GACGGCTGCG CCTCGGTCCC CGCGGCCTCC CGCTTCTTCT CTCGGGACTC CTGCTACCTC 120 

TGTGCCGCGC CTTCAACCTA GACGTGGACA GTCCTGCCGA GTACTCTGGC CCCGAGGGAA 180 

GTTACTTCGG CTTCGCCGTG GATTTCTTCG TGCCCAGCGC GTCTTCCOGG ATGTTTCTTC 240 

50 TCGTGGGAGC TCCCAAAGCA AACACCACCC AGCCTGGGAT TGTGGAAGGA GGGCAGGTCC 300 

TCAAATGTOA CTGGTCTTCT ACCCGCCGGT GCCAGCCAAT TGAATTTGAT GCAACAGGCA 360 

ATAGAGATTA TGCCAAGGAT GATCCATTGG AATTTAAGTC CCATCAGTGG TTTGGAGCAT 420 

CTGTGAGGTC GAAACAGGAT AAAATTTTGG CCTGTGCCCC ATTGTACCAT TGGAGAACTG 480 

AGATGAAACA GGAGCGAGAG CCTGTTGGAA CATGCTTTCT TCAAGATGGA ACAAAGACTG 540 

55 TTGAGTATGC TCCATGTAGA TCACAAGATA TTGATGCTGA TGGACAGGGA TTTTGTCAAG 600 

GAGGATTCAG CATTGATTTT ACTAAAGCTG ACAGAGTACT TCTTGGTGGT CCTGGTAGCT 660 

TTTATTGGCA AGGTCAGCTT ATTTCGGATC AAGTGGCAGA AATCGTATCT AAATACGACC 720 

CCAATGTTTA CAGCATCAAG TATAATAACC AATTAGCAAC TCGGACTGCA CAAGCTATTT 780 

TTGATGACAG CTATTTGGGT TATTCTGTGG CTGTCGGAGA TTTCAATGGT GATGGCATAG 840 

60 ATGACTTTGT TTCAGGAGTT CCAAGAGCAG CAAGGACTTT GGGAATGGTT TATATTTATG 900 

ATGGGAAGAA CATGTCCTCC TTATACAATT TTACTGGCGA GCAGATGGCT GCATATTTCG 960 

GATTTTCTGT AGCTGCCACT GACATTAATG GAGATGATTA TGCAGATGTG TTTATTGGAG 1020 

CACCTCTCTT CATGGATCGT GGCTCTGATG GCAAACTCCA AGAGGTGGGG CAGGTCTCAG 1080 

TGTCTCTACA GAGAGCTTCA GGAGACTTCC AGACGACAAA GCTGAATGGA TTTGAGGTCT 1140 

65 TTGCACGGTT TGGCAGTGCC ATAGCTCCTT TGGGAGATCT GGACCAGGAT GGTTTCAATG 1200 

ATATTGCAAT TGCTGCTCCA TATGGGGGTG AAGATAAAAA AGGAATTGTT TATATCTTCA 1260 

ATGGAAGATC AACAGGCTTG AACGCAGTCC CATCTCAAAT CCTTGAAGGG CAGTGGGCTG 1320 

CTCGAAGCAT GCCACCAAGC TTTGGCTATT CAATGAAAGG AGCCACAGAT ATAGACAAAA 1380 

ATGGATATCC AGACTTAATT GTAGGAGCTT TTGGTGTAGA TCGAGCTATC TTATACAGGG 1440 

70 CCAGACCAGT TATCACTGTA AATGCTGGTC TTGAAGTGTA CCCTAGCATT TTAAATCAAG 1500 

ACAATAAAAC CTGCTCACTG CCTGGAACAG CTCTCAAAGT TTCCTGTTTT AATGTTAGGT 1560 

TCTGCTTAAA GGCAGATGGC AAAGGAGTAC TTCCCAGGAA ACTTAATTTC CAGGTGGAAC 1620 

TTCTTTTGGA TAAACTCAAG CAAAAGGGAG CAATTCGACG AGCACTGTTT CTCTACAGCA 1680 

GGTCCCCAAG TCACTCCAAG AACATGACTA TTTCAAGGGG GGGACTGATG CAGTGTGAGG 1740 

75 AATTGATAGC GTATCTGCGG GATGAATCTG AATTTAGAGA CAAACTCACT CCAATTACTA 1800 

TTTTTATGGA ATATCGGTTG GATTATAGAA CAGCTGCTGA TACAACAGGC TTGCAACCCA I860 
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TTCTTAACCA GTTCACGCCT GCTAACATTA GTCGACAGGC TCACATTCTA CTTGACTGTG 1920 

GTGAAGACAA TGTCTGTAAA CCCAAGCTGO AAGTTTCTGT AOATAOTGAT CAAAAGAAGA 1960 

TCTATATTCG GGATGACAAC CCTCTOACAT TGATTGTTAA GGCTCAGAAT CAAGGAGAAG 2040 

GTGCCTACGA AGCTGAGCTC ATCGTTTCCA TTCCACTGCA QGCTGATTTC ATCGGGGTTO 2100 

5 TCCGAAACAA TGAAGCCTTA aCAAOACTTT CCTGTGCATT TAAGACAGAA AACCAAACTC 2160 

GCCAGGTGGT ATGTGACCTT GGAAACCCAA TGAAGGCTGG AACTCAACTC TTAOCTQOTC 2220 

TTOGTTTCAQ TGTQCACCAO CAOTCAGAGA TGGATACTTC TGTGAAATTT QACTTACAAA 2280 

TCCAAAGCTC AAATCIATTT GACAAAGTAA GCCCAGTTOT ATCTCACAAA GTTGATCTTO 2340 

CTQTTTTAGC TGCAGTTGAG ATAAGAGGAG TCTCGAGTCC TGATCATATC TTTCTTCOGA 2400 

10 TTCCAAACTG GGAGCACAAG GAGAACCCTG AGACTGAAGA AGATGTTGGG CCAGTTGTTC 2460 

AGCACATCTA TGAGCTGAGA AACAATGGTC CAAGTTCATT CAGCAAGGCA ATGCTCCATC 2520 

TTCAGTGGCC TTACAAATAT AATAATAACA CTCTGTTOTA TATCCTTCAT TATGATATTG 2 580 

ATGGACCAAT GAACTGCACT TCAGATATGG AGATCAACCC TTTGAQAATT AAGATCTCAT 2640 

CTTTGCAAAC AACTGAAAAG AATGACACGG TTGCOGGGCA AGGTGAOCGG GACCATCTCA 2700 

15 TCACTAAGCG GGATCTTGCC CTCAGTGAAG GAGATATTCA CACTTTGGGT TGTGGAGTTG 2760 

CTCAGTGCTT GAAGATTGTC TGCCAAGTTG GGAGATTAGA CAGAGGAAAG AGTGCAATCT 2820 

TGTACGTAAA OTCATTACTG TGGACTGAGA CTTTTATGAA TAAAGAAAAT CAGAATCATT 2880 

CCTATTCTCT GAAGTCGTCT GCTTCATTTA ATGTCATAGA GTTTCCTTAT AAGAATCTTC 2940 

CAATTGAGGA TATCACCAAC TCCACATTGG TTACCACTAA TOTCACCTGG GGCATTCAGC 3000 

20 CAGCGCCCAT GCCTGTGOCT GTGTGGGTGA TCATTTTAGC AGTTCTAGCA GGATTGTTGC 3060 

TACTGGCTGT TTTGGTATTT GTAATGTACA GQATGGGCTT TTTTAAACGG GTCCOGCCAC 3120 

CTCAAGAAGA ACAAGAAAGG GAGCAGCTTC AACCTCATGA AAATGGTGAA GGAAACTCAO 3180 

AAACTTAACT GCAGTTTTTA AGTTATGCTA CATCTTGAOC CACTAGAATT AGCAACTTTA 3240 

TTATAGATTT AAACTTTCTT CATGAGGAGT AAAAATCCAA GGCTTTACTG CTGATAGTGC 3300 

25 TAATTGGCAT TAACCACAAA ATGAGAATTA TATTTGTCAA CCTTCTCCTT ATAAATAAGT 3360 

TCAGACATAC ATTTAATAAC ATAGGGTGAC TTOTGTTTTT AGGTATTTAA ATAAT AAAA T 3420 

TTCAAGGGAT AGTTTTTATT CAATGTATAT AAGACAGGTA GTGCCTGATT TACTACTTTA 3480 

TATAAAATAG TACCTCCTTC AGTTACTGTT TCTGATTTAA TGTACGGAAC TTTATTTGTT 3540 

QTTGTTGTTG TTGTTGTTGT TGTTGTTTTA AAGCAGTCCA AATTTGGACC TTAGCAATCA 3600 

30 TGTCTTTTGT ATAGGTACTT AATGTTAATA CATATTACAC TACAGTTTAC TTTTCAGAAT 3660 

ACTAAAGACT TTATAACTGC ATGAACTTGG ATTTTTTTAA TCACTCATAT GGTAGAATTT 3720 

TATAAACACA TACATGATAC CATCCAAATT CTTGCTTTTA ATAACAAAGG TACAATATTT 3780 

TGTTTTAGTA TGAAAATCTG GTAGATCCTA TTACACTTCT GTTTATATTA AATCCACAAT 3840 

ATTTTATTAC ATTTTTAACT TGTATAAATT TTAGGTCAAA TCCTTCAAGC CAACCTATAC 3900 

35 TAAAAATTAG TTCCATAATC ACAAATGGCT CnTTGTGTA ATTGTTTAAT TTCACCTGAA 3960 

TATCATAATG CTTAAAGCCA TATGGAGTTG GAAATTATTT CCAAAGCATA TTTATTCCAT 4020 

TGTTTTAGTC TGGCTATTTA CAGTATAAAA AAAGCATTTT ATTAAAATAC TGTGTAQTTC 4080 

TTTGAGATAG TTGCTTATGC ATATAGTAAG TATTACATTC TTAGAGTAGA GCAGAGTTTT 4140 

TAGTTAGTAT TAATTTATTT TCCTCCATTC ATGTACTTTT CCTTATATTT CCAAAACTGT 4200 

40 TACTGAGAAT GGGTCAAGAT CAGTGAGAAA TCTTTACAGT TGACAGGAAC CTGGACCCCT 4260 

TACCCCAACT TTATGAGTAA TGCTTGGAAT AAAAAACTCT TAAGGCAACT CACTGATTTA 4320 

CTTCTAGCAA TAGCATGATG TTACAGGAAT ATTACCTCTO TTTAAGCAAG GTAATGTGTA 4380 

AAATCAGTCT C5GGCTOTCAG AATAACTTCT AAAAGGTATT TTTATAAGCA GTTCAAGTTA 4440 

CTGAAAACCT TTTAAACCTT TCTGAAGTTC GTTAGTATAA ATTACTTTTC TAGGATTATT 4500 

45 AATAAAAGCC ACATAGGTGG CAAGTTGTAG TTTTATATGG CTCTGTAGAG TGGTGAACCT 4560 

TCTAGAGGAA TATATGATTT ATTCACAGTT CCTCAAGGCC TGGGGATGAT GATCAGTTAT 4620 

ACCTATTTTT GTGCAATTAC ATCATGTTGT ACATTAGAAA TGGAGAGTTT AATAGCTCTT 4660 

TAACTGCTGT CCTCATTAGG TAATGATAAA TATTTCCCTT AAATAATTGA CTATTTTGCT 4740 

GTGTTTTAAA AATGATTGAA ATTTATCTTG CCATATCTCA TAATTTCATG CACAAGTTGA 4800 

50 CTGAGCTAAT CTTGAGAATA TATTCGTAAA AIAGGAGCAC ATTTAGTTGA GGTATACAAG 4860 

GTAGGACTCT AGACAAAACC TTCTATTTTA GCTTTAGTGA ATTTCAAAAG TAATGGGTCT 4920 

TGGAGTATAG ATTTTTATTA GTAGCTTGAA AGAGCTTAAT CATATGCAGT AAGTATTTTT 4980 

ATTACCAATA AATTTAAAAT TTTTTAAGAA AAATATTTTT ATCCTAGGGC CAAGTGTTGC 5040 

CTGCCACCAA TCAGTAAGTT AGTCTATAAC AAATTTTACC CTAACAGTTT TACCACCTAG 5100 

55 CAACAGTCAT TTCTGAAAAT ATGTTGGATA GAAAGTCACT CTTTGGCAAA AGTGTTAGAA 5160 

TTTGCTTTTG TGCCATCTAT TCCTTTTATG GCATCTATCT TGAAAGTAAT CTTGTATTGG 5220 

AGATTGAAAG ATGCTGTAAT TTAGAAATTA ACATGATATC TTAAATTACC TTTATGAAAT 5280 

ATAGTTTTGT ATAATAGCAT AGATTTTCCT TCAAAAAATG AACATTTATA TATCTACAAA 5340 

AATATGGAGA AGAGCAATTT GAAAQCCTAC TTTCTGAAGA AAATGGTGOG ATTTTTTTTT 5400 

60 ATCATGATTA AATATCAAAA AATTGCCCTA TGAAAACTTT AAATCTCTAA AAC ATTTGAA S460 

ATACTACCAT ATTTGTGATT TATTGAGAAT AAAAATCCAT TTTGAAATGT AAAATTTTTA 5520 

TGATCTGATT CAGTTTTAAG AAAACATGAA TGAACTAGAA GATATTAAAA ACATTTGACA 5580 

TTGGTAAGAA ATATTGATAC TGATATTGAT TTTTATATAG GTATTTATTT CAGAATTGAT 5640 

ATTTTGAGAA AAATACATGT GAGTCATTTT TTCTGTTTCT CTTTTCTCTT AACGATTATC 5700 

65 ACTGTAATTC TGAATCT 

Seq ID KO: 189 Protein semience: 
Protein Accession #j NPJ>02201.1 

70 1 11 21 31 41 51 

I I I I I I 

MAFPPRRRLR LGPRGLPLLL SGLLLPbCPA PNLDVDSPAB YSGPEGSYPG PAVDPFVPSA 60 

SSRMPLLVGA PKANTTQPGI VBGGQVLKCD WSSTRRCQPI EPDATGNRDY AKDDPLBFKS 120 

HQWFGASVRS KQDKI LACAP LYHWRTEMKQ EREPVGTCFL QDGTKTVEYA PCRSQDIDAD 180 

75 GQGFCQGGFS IDFTKADRVL LGGPGSFYWQ GQLISDQVAE IVSKYDPNVY SIKYNNQLAT 240 

RTAQAIFDDS YLGYSVAVGD FNGDGIDDFV 8GVPRAARTL GMVYIYDGKN MSSLYNFTGE 300 



269 



WO 02/079492 



PCT/US02/04915 



AATDINGDDY ADVFIGAPLF MDRGSDGKLQ EVGQVSV8&Q RASGDFQTTK 360 

GSAIAPLGDti DQDOPNDIAI AAPYGGEDKX GIVYIFNGRS TGLNAVPSQI 420 

PPSFGYSMKG ATDIDKNGYP DIJVGAFGVD RAILYRARPV ITVNAGIBVY 460 

CSLPGTALKV SCFNVRFCLK ADGKGVLPRX LNFQVBLLLD KLKQKGAIRR 540 

HSKNMTISRG GLMQCEBLIA YLRDESBFRD KLTPITIFME YRLDYRTAAD €00 

FTPANISRQA HXLLDOGEDN VCKFKLBVSV DSDQKKIYIG DDNPLTLIVK 660 

ABLIVSIPLQ ADPIGWKNN EALARLSCAF KTSNQTRQW CDLGNFMKAG 720 

VHQQSEMDTS VKFDLQIQSS NLFDKVSPW SHKVDLAVLA AVEIRGVSSP 760 

EHKKNPBTEE DVQPWQHZY ELRNNGPSSP SKAMLHLQWP YKYNNNTLLY 640 

NCTSDMEIKP LRIKISSLQT TEKNDTVAGQ GBRDHLXTKR DLALSEGDM 500 

KIVCQVORLD ROKSAIIiYVK SIiLWTETFHN KENQNHSYSL KSSASPNVTE 960 

ITNSTLVTTN VTWGIQPAPM PVPVWVIILA VLAOLLLLAV LVFVMYRKQP 1020 
QEREQLQPHE NGEGNSET 

Seq ID NOi 190 pNA, sequence 
Nucleic Acid Accession fti NM_004864 

Coding sequence! 26-9S2 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CGGAACGAGQ GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCOG AAGACTCCAG 180 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGOGGCTG GGATCCGGCG GCCACCTGCA CCTGOGTATC TCTOGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTOGTGG GACGTGACAC GACCGCTGCG GOGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGOGCTG CACCTGCGAC TGTCGCCGCC GCCGTOGCAG TCGGACCAAC TGCTGGCAGA 540 

ATCTTCGTCC GCACGGCCCC AGCTGOAGTT GCACTTGCGG COGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA AOGGGGAOGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT GGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGGC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGOGGCAAA 780 

CATGCAOGOG CAGATCAAGA CGAGCCTGCA COGCCTGAAG CCCGACACGG AGCCAGOGCC 840 

CTGCTGCGTG CCOGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACOGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTCC CACTGCAT AT GAG CAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

ACTGTCTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 
AAAA 

Seq ID BO: 191 Proton sequence: 
Protein Accession #: NP_0048S5 

1 11 21 31 41 51 

I I I I I I 

NPGQBLRTVN GSQMLLVLLV LSWLPHGGAL SLABASRASP PGPSBLHSED SRFRELRKRY 60 

EDLLTRLRAN QSWBDSNTDL VPAPAVRILT PEVRI/3SGGH LHLRISRAAL PBGLPEASRL 120 

HRAIiFRLSPT ASRSWDVTRP LRRQLSLARP QAPALHLRIiS PPPSQSDQLL AESSSARPQL 180 

ELKLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 240 

IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYK PMVLIQKTDT GVSLQTYDDL 300 
LAKDCHCI 

Seq ID NO: 192 PNA sequence 

Nucleic Acid Accession #: XM_061731.1 

Coding sequence: 1-567 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

ATGAGAAAAQ GAAATGAGGG AOAQAACACA GAAGAGGGCA GGCTTGCTCA GCTTGCTCAA 60 

AGAAAGTTTC TCAAAGAAGA TGGCATTACA TTGCACATCT CTCTGTGTCT CTCTATTGCT 120 

GTAAAAGAAC CTTTCTCTCT GATTGGACTT GACACACAGA AGGATCTCAG TAAAOATTTG 180 

CTGTTGTTGA TGTCCACAGA CACTGGCAAG GACAGGTTTA CCAACATACT GCTGTCACAC 240 

TCCCCTCCAA TQTGCACCAA ATCACGTAAA AATGGGGATA ATGACTCCCC TGCCTTCACA 300 

TGGGGTGGCA AAGACACCAG GAGCAATACT GATCTTCCTA TCAGAGACCC TGGGGGCAAG 360 

AGTCTTTCAC TCACCAAACA TTCCCACAAG CCTGTCCCTG AGCATCAGTG TGACCAGAGA 420 

GAGGTCTTCC AGCCACTTTC AGAGCCAGGT GTAGAAGCAG AGATGGAAGT GTTCGCTGAT 460 

GCTGGATGGT GGATTTATCA GAGCTGTCAG GTTCCTTCCT CAACCCTTGC AAGAAAGAAG 540 
ATGGTTTATT CTAAAGAAAC TQAGTGA 

Seq ID NO: 193 Prgfeein sequence \ 
Protein Accession #: XPJJ61731.1 

1 11 21 31 41 51 

I I I I I I 



QMAAYPGFSV 
lilGPEVPARF 
LEGQWAARSM 
PSILNQDNKT 
ALPLYSRSPS 
TTGfcQPILNQ 
AQNQGEGAYB 
TQLLAGLRPS 
DHIPLPIPNW 
ILHYDIDGPM 
TLGCGVAQCIi 
PPYKNLPIED 
FKRVRPPQKE 
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KRKGNEGBNT EEGRIAQLAQ RKFLKEDGIT LHISIiCLSIA VKBPFSLIGL DTQKDLSKDL 60 

U^MSTDTGK DRPTNILLSH SPPMCTK8RK NGDNDSPAFT HGGKDTRSNT DLPIRDPGOK 120 

8LSLTKHSHK PVPKHQCDQR BVFQPLSBPO VEABMBWAD AGWWIYQSCQ VPSSTLARKK 180 
^ MVYSKBTB 

Seq ID NO: 194 DMA. Sequence 

Nucleic Acid Accession #: NM_O0S415.2 

Coding sequence: 371-2410 (underlined sequences correspond to start and stop codons) 

10 1 11 21 31 41 51 

I I I I I I 

GAGCTGTCCC CGGTGCGGCC GACCOGGGCC GTGC0GT6TQ CCCGTGGCTC CAQCCGCTGC 60 

CGCCTCGATC TCCTCGTCTC CCOCTCCGCC CTCCCTTTTC CCTGQATGAA CTTGCGTCCT 120 

TTCTCTTCTC CGCCATOGAA TTCTGCTCOO TGCTTTTAGC CCTCCTOAGC CAAAGAAACC 180 

15 CCAGACAACA GATGCCCATA GGCAGCGTAT AOCAGTAACT CCCCAGCTOQ GTTTCTOTGC 240 

CGTAGTTTAC AQTATTTAAT TTTATATAAT ATATATTATT TATTATAGCA TTTTTGATAC 300 

CTCATATTCT GTTTACACAT CTTOAAAQGC GCTCAGTAGT TCTCTTACTA AACAACCACT 360 

ACTCCAGAGA ATGGCAACGC TOATTACCAG TACTACAGCT GCTACCOCCQ CTTCTGGTCC 420* 

TTTGGTGGAC TACCTATGGA TGCTCATCCT GGGCTTCATT ATTGCATTTG TCTTGGCATT 480 

20 CTCCGTGOGA 6CCAATGAT0 TAGCAAATTC TTTTGGTACA OCTQTGGOCT CAGGTGTAQT 540 

GACCCTQAAG CAAGCCTOCA TCCTAGCTAG CATCTTTQAA ACAGTGOGCT CTOTCTTACT 600 

GGGQGCCAAA OTGAGCGAAA CCATCCGGAA GGGCTTGATT GACGTGGAQA TOTACAACTC 660 

GACTCAAGGQ CTACTGATGG CCGQCTCAGT CAOTGCTATO TTTQGTTCTG CTGTCTGGCA 720 

ACTCGTGGCT TCGTTTTTGA AGCTCCCTAT TTCTGGAACC CATTSTATTX3 TTGGTGCAAC 780 

25 TATTGGTTTC TCCCTCOTOG CAAAGGOGCA GGAGGGTGTC AAGTGGTCTG AACTOATAAA 840 

AATTGTGATG TCTTGCTTCG TOTCCCCACT GCTTTCTGGA ATTATGTCTG GAATTTTATT 900 

CTTCCTGGTT CGTGCATTCA TCCTCCATAA GGCAGATCCA GTTCCTAATG GTTTGCGAGC 960 

TTTGCCAGTT TTCTATGCCT GCACAGTTGG AATAAACCTC TTTTCCATCA TGTATACTGG 1020 

AGCACCGTTG CTGGGCTTTG ACAAACTTCC TCTGTGGGGT ACCATCCTCA TCTCGGTGGG 1080 

30 ATGTGCAGTT TTCTGTGCCC TTATCGTCTG GTTCTTTGTA TGTCCCAGGA TGAAGAGAAA 1140 

AATTGAACGA GAAATAAAGT GTAGTCCTTC TGAAAGCCCC TTAATGGAAA AAAAGAATAG 1200 

CTTGAAAGAA GACCATGAAG AAACAAAGTT GTCTGTTGGT GATATTGAAA ACAAGCATCC 1260 

TGTTTCTGAG GTAGGGCCT3 CCACTGTGCC CCTCCAGGCT GTGGTGGAGG AGAGAACAGT 1320 

CTCATTCAAA CTTGGAGATT TGGAGGAAGC TCCAGAGAGA GAGAGGCTTC CCAGOGTGGA 1380 

35 CTTGAAAGAG GAAACCAGCA TAGATAGCAC CGTGAATGGT GCAGTGCAGT TGCCTAATGG 1440 

GAACCTTGTC CAGTTCAGTC AAGCCGTCAG CAACCAAATA AACTCCAGTG GCGACTCCCA 1500 

GTATCACACC GTGCATAAGG ATTCCGGCCT GTACAAAGAG CTACTCCATA AATTACATCT 1560 

TGCCAAGGTG GGAGATTGCA TGGGAGACTC CGGTGACAAA CCCTTAAGGC GCAATAATAG 1620 

CTATACTTCC TATACCATGG CAATATGTGG CATGCCTCTG GATTCATTCC GTGCCAAAGA 1680 

40 AGGTGAACAG AAGGGCGAAG AAATGGAGAA GCTGACATGG CCTAATGCAG ACTCCAAQAA 1740 

GCGAATTCGA ATGGACAGTT ACACCAOTTA CTGCAATGCT GTGTCTGACC TTCACTCAGC 1800 

ATCTGAGATA GACATGAOTG TCAAGGCAGC GATGGGTCTA GGTGACAGAA AAGGAAGTAA 1860 

TGGCTCTCTA GAAGAATGGT ATGACCAGGA TAAGCCTGAA GTCTCTCTCC TCTTCCAGTT 1920 

CCTGCAGATC CTTACAGCCT GCTTTGGGTC ATTCGCCCAT GGTGGCAATG ACGTAAGCAA 1980 

45 TGCCATTGGG CCTCTGGTTG CTTTATATTT GGTTTATGAC ACAGGAGATG TTTCTTCAAA 2040 

AGTGGCAACA CCAATATGGC TTCTACTCTA TGGTGGTGTT GGTATCTGTG TTGGTCTGTG 2100 

GGTTTGGGGA AGAAGAGTTA TCCAGACCAT GOGGAAGGAT CTGACACOGA TCACACCCTC 2160 

TAGTGGCTTC AGTATTGAAC TGOCATCT3C CCTCACTGTG GTGATTGCAT CAAATATTGG 2220 

CCTTCCCATC AGTACAACAC ATTGTAAAGT GGGCTCTGTT GTGTCTGTTG GCTGGCTCCG 2280 

50 GTCCAAGAAG GCTQTTGACT GGCGTCTCTT TCGTAACATT TTTATGGOCT GGTTTGTCAC 2340 

AGTCCCCATT TCTGGAGTTA TCAOTGCTGC CATCATGGCA ATCTTCAGAT ATGTCATCCT 2400 

CAGAATGISA AGCTGTTTGA GATTAAAATT TGTGTCAATG TTTGGGACCA TCTTAGGTAT 2460 

TCCTGCTCCC CTGAAGAATG ATTACAGTGT TAACAGAAGA CTGACAAGAG TCTTTTTATT 2520 

TGGGAGCAGA GGAGGGAAGT GTTACTTGTG CTATAACTGC TTTTGTGCTA AATATGAATT 2580 

55 GTCTCAAAAT TAGCTGTGTA AAATAGCCCG GGTTCCACTG GCTCCTGCTO AGGTCCCCTT 2640 

TCCTTCTGGG CTGTGAATTC CTGTACATAT TTCTCTACTT TTTGTATCAG GCTTCAATTC 2700 

CATTATOTTT TAATGTTGTC TCTGAAGATG ACTTOTGATT TTTTTTTCTT TTTTTTAAAC 2760 

CATGAAGAGC CGTTTGACAG AGCATGCTCT GCGTTGTTGG TTTCACCAGC TTCTGCCCTC 2820 

ACATGCACAG GGATTTAACA ACAAAAATAT AACTACAACT TCCCTTGTAG TCTCTTATAT 2880 

60 AAGTAGAGTC CTTGGTACTC TGCCCTCCTG TCAGTAGTGG CAGGATCTAT TGGCATATTC 2940 

GGGAGCTTCT TAGAGGGATG AGGTTCTTTG AACACAGTGA AAATTTAAAT TAGTAACTTT 3000 

TTTGCAAGCA GTTTATTOAC TGTTATTGCT AAGAAGAAGT AAGAAAGAAA AAGCCTGTTG 3060 

GCAATCTTGG TTATTTCTTT AAGATTTCTG GCAGTGTGGG ATGGATGAAT GAAGTGGAAT 3120 

GTGAACTTTG GGCAAGTTAA ATGGGACAGC CTTCCATGTT CATTTGTCTA CCTCTTAACT 3180 

65 GAATAAAAAA GCCTACAGTT TTTAGAAAAA ACCCGAATTC 

Seq ID NO; 19S Profcej-Tl sequence t 
Protein Accession tf: NP_005406.2 

70 1 11 21 

I I I 

KATLITSTTA ATAASGPLVD YLWMLILGFI 
QACILASIPB TVGSVLLGAK VSBTIRKGLI 
SPLKLPISGT HCIVGATIGF SLVAKGQEGV 
75 . RAPILHKADP VPNGLRALPV FYACTVGINL 
FCALIVWPFV CPRMKRKIER BIKCSPSESP 
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31 41 51 

I I I 

IAFVLAFSVG ANDVANSPGT AVGSGWTLK 60 

DVEMYNSTQG LLMAGSVSAM FGSAVWQLVA 120 

KWSELIKIVM SWPV3PLLSG IMSGILFPLV 180 

PSIKYTGAPL LGPDKLPLWG TILISVGCAV 240 

LMEKKNSIiKB DH8ETKLSVG DIENKHPVSB 300 



WO 02/079492 



PCMJS02/04915 



VGPATVPLQA WBEKTVSFK LGDLBEAPBR BRLPSVDLKB ETSIDSTVKG AVQLPNGNLV 360 

QPSQAVSNQI N8SGHSQYHT VHKDSGLYKE IdtfKLHLAKV GDCMGDSGDK PLRRKNSYTS 420 

YTNAICGHPL DSFRAKEGEQ KGEEMBKLTW PKADSKKRIR KDSYTSYCNA VSDLHSASEI 480 

DMSVKAAKGL GDRKGSNGSL EEWYDQDKPE VSLLFQPLQI LTACFGSFAH GGNDVSNAIO 540 

PLVALYLVYD TGDVSSKVAT PIWLLLYGGV GICVGLWVWG RRVTQTMGKD LTPITPSSGF 600 

SIB1ASALTV VIASNIGIiPI STTHCKVGSV VSVGWLRSKK AVDWRLPRNI FMAWFVTVPI 660 
SGVISAAXMA IPRYVILiRM 



Seq ZD NO: 196 PNA aggtiCBlgfl 

Nucleic Acid Accession »t NM_0 00 020.1 

Coding sequence! 283-1794 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

AGGAAACGGT TTATTAGGAG GGAQTGGTGG AQCTGGGCCA GGCAGGAAGA CGCTGGAATA 60 

AGAAACATTT TTCCTCCAGC CCCCATCCCA GTCCCGGGAG GCTGCCGCGC CAGCTGCGCC 120 

GAGCGAGCCC CTCCCCGGCT CCAGCCCGGT CCGGGGCCGC GCCGGACCCC AGCCCGCCGT 180 

CCAGCGCTGG CGGTGCAACT GCGGCCGCGC GGTGGAGGGG AGGTGGCCCC GGTCCGCCGA 240 

AGGCTAGCGC CCCGCCACCC GCAGAGCGGG CCCAGAGGGA CCATGACCTT GGGCTCCCCC 300 

AGQAAAGGCC TTCTGATGCT GCTGATGGCC TTGGTGACCC AGGGAGACCC TGTGAAGCCG 360 

TCTCGGGGCC CGCTGGTGAC CTGCACGTGT OAGAGCCCAC ATTOCAAGGG GCCTACCTGC 420 

CGQGGGGCCT GGTGCACAGT AGTGCTGGTG CGGGAGGAGG GGAGGCACCC CCAGGAACAT 480 

CGGGGCTGCG GGAACTTGCA CAGGGAGCTC TGCAGGGGGC GCCCCACCGA GTTCGTCAAC 540 

CACTACTGCT GCGACAGCCA CCTCTGCAAC CACAACGTGT CCCTGGTGCT GGAGGCCACC 600 

CAACCTCCTT CGGAGCAGCC GGGAACAGAT GGCCAGCTGG CCCTGATCCT GGGCCCCGTG 660 

CTGGCCTTGC TGGCCCTGGT GGCCCTGGGT GTCCTGGGCC TGTGGCATGT CCGACGOAGG 720 

CAGGAGAAGC AGCGTGGCCT GCACAGCGAG CTGGGAGAGT CCAGTCTCAT CCTGAAAGCA 780 

TCTGAGCAGG GCGACACGAT GTTGGGGGAC CTCCTGGACA GTGACTGCAC CACAGGGAGT 840 

GGCTCAGGGC TCCCCTTCCT GGTGCAGAGG ACAGTGGCAC GGCAGGTTGC CTTGGTGGAG 900 

TGTGTGGGAA AAGGCCGCTA TGGCGAAGTG TGGCGGGGCT TGTGGCACGG TOAOAGTGTG 960 

GCCGTCAAGA TCTTCTCCTC GAGGGATGAA CAGTCCTGGT TCCGGGAGAC TGAGATCTAT 1020 

AACACAGTAT TGCTCAGACA CGACAACATC CTAGGCTTCA TCGCCTCAGA CATGACCTCC 1080 

CGCAACTCGA GCACGCAGCT GTGGCTCATC AOGCACTACC ACGAGCACGG CTCCCTCTAC 1140 

GACTTTCTGC AGAGACAGAC GCTGGAGCCC CATCTGGCTC TGAGGCTAGC TGTGTCCGCG 1200 

GCATGCGGCC TGGOGCACCT GCACGTGGAG ATCTTCGGTA CACAGGGCAA ACCAGCGATT 1260 

GCCCACCGCG ACTTCAAGAG CCGCAATGTG CTGGTCAAGA GCAACCTGCA GTGTTGCATC 1320 

GCCGACCTGG GCCTGGCTGT GATGCACTCA CAGGGCAGCG ATTACCTGGA CATCGGCAAC 1380 

AACCCGAGAG TGGGCACCAA GCGGTACATG OCACCOGAGG TGCTGGACGA GCAGATCCGC 1440 

ACGGACTGCT TTGAGTCCTA CAAGTOGACT GACATCTGGG CCTTTGGCCT GGTGCTGTGG 1500 

GAGATTGCCC GCCGGACCAT CGTGAATGGC ATCGTGGAGG ACTATAGACC ACCCTTCTAT 1560 

GATGTGGTGC CCAATGACCC CAGCTTTGAG GACATGAAGA AGGTGGTGTG TGTGGATCAG 1620 

CAGACCCCCA CCATCCCTAA CCGGCTGGCT GCAGACCCGG TCCTCTCAGG CCTAGCTCAG 1680 

ATGATGCGGG AGTGCTGGTA CCCAAACCCC TCTCCCCGAC TCACCGCGCT GCGGATCAAG 1740 

AAGACACTAC AAAAAATTAG CAACAGTCCA GAGAAGCCTA AAGTGATTCA ATAGCCCAQQ la 00 

AGCACCTGAT TCCTTTCTGC CTGCAGGGGG CTGGGGGGGT GGGGGGCAGT GGATGGTGCC I860 

CTATCTGGGT AGAGGTAGTG TGAGTGTGGT GTGTGCTGGG GATGGGCAGC TGCGCCTGCC 1920 
TGCTCGGCCC CCAGCCCACC CAGCCAAAAA TACAGCTGGG CTGAAACCTG 



Seq ID NO; 197 protein sequence; 
Protein Accession #: NP_000011.1 

1 11 21 31 41 SI 

I I I I 1 I 

MTU3SPRKGL LMLLMALVTQ GDPVKPSRGP LVTCTCESPH CKGPTCRGAW CTWI.VREEG 60 

RHPQEHRGCG NLHRELCRGR PTBFVNHYCC DSHLCHHMVS IiVLEATQPPS EQPGTDGQLA 120 

L1LGPVLALL ALVALGVIjGL WHVRRRQEKQ RGLHSBLGBS SLILKASEQG DTMLGDLLDS 180 

DCTTGSGSGL PPLVQRTVAR QVALVECVGK GRYGEVWRGL WHGE8VAVKI FSSRDBQSWF 240 

RETEIYNTVL LRHDNILGFI ASDMTSRNSS TQLWLITHYH EHGSLYDFLQ RQTLEPHIAL 300 

RIAVSAACGIi AHLHVBIFGT QGKPAIAHRD PKSRNVLVKS NLQCCIADLG LAVMHSQGSD 360 

YLDIGMNPRV GTKRYMAPBV LDEQIRTDCF BSYKWTDIWA FGLVLWEIAR RTIVNGIVED 420 

YRPPPYDWP NDPSPEDMKK WCVDQQTPT IPNRLAADPV LSGLAQHMRB CWYPNPSARL 480 
TAIiRIKKTLQ KISHSPEKPK VTQ 



Seq ID NO: 198 DNA sequence 

Nucleic Acid Accession #: NMJ503 199.1 

Coding sequence: 200-2203 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

CGGGGGGATC TTGGCTGTGT GTCTGCGGAT CTGTAGTGGC GGCGGCGGCG GCGGCGGCGG 60 

GGAGGCAGCA GGCGCGGGAG CGGGCGCAGG AGCAGGCGGC GGCGGTGG CO GCGGCGGTTA 120 

GACATGAACG CCGCCTCGGC GCCGGOGGTG CACGGAGAGC CCCTTCTCGC GCGCGGGCGG 180 

TTTGTGTGAT TTTQCTAA AA TG CATCACCA ACAGCGAATG GCTGCCTTAG GGACGGACAA 240 

AGAGCTGAGT GATTTACTGG ATTTCAGTGC GATGTTTTCA CCTCCTGTGA GCAGTGGGAA 300 

AAATGGACCA ACTTCTTTGG CAAGTGGACA TTTTACTGGC TCAAATGTAG AAGACAGAAG 360 

TAGCTCAGGG TCCIGGGGGA ATGGAGGACA TCCAAGCCCG TCCAGGAACT ATGGAGATGG 420 
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GACT CCCTAT GACCACATGA CCAGCAGGGA CCTTGGGTCA CATGACAATC TCTCTCCACC 
TTTTGTCAAT TCCAGAATAC AAAGTAAAAC AGAAAGGGGC TCATACTCAT CTTATGOGAG 
AGAATCAAAC TTACAGGGTT GCCACCAOCA GAGTCTCCTT OGAGGTQACA TGQATATQGG 
CAACCCAGGA ACCCTTTCGC CCACCAAACC TGGTTCCCAO TACTATCAGT ATTCTAQCAA 
5 TAATCCCCGA AGGAGGCCTC TTCACAOTAO TGCCATGOAG OTACAGACAA AGAAAGTTCG 
AAAAOTTCCT CCAGGTTTOC CATCTTCAOT CTATGCTCCA TCAGCAAGCA CTGCOQACTA 
CAATAQGGAC TOGCCAGGCT ATCCTTOCTC CAAACCAGCA ACCAGCACTT TCCCTACCTC 
CTTCTTCATG CAAGATOGCC ATCACAQCAQ TOACCCTTGG AGCTCCTOCA GTGGGATGAA 
TCAQCCTGGC TATGCAGGAA TCTTGGGCAA CTCTTCTCAT ATTCCACAGT CCAGCAGCTA 

10 CTOTAOCCTG CATCCACATQ . AACGTTTGAG CTATCCATCA CACTCCTCAG CAGACATCAA 
TTCCAGTCTT CCTCOGATOT CCACTTTCCA TCSTACTG6T AGAAACCATT ACAGCACCTC 
TTCCTGTACG CCTCCTGCCA ACGGGACAGA CAGTATAATG GCAAATAGAG GAAGCGGGGC 
AGCCGGCAGC TOCCAOACTQ GAGATGCTCT GGGGAAAGCA CTTGCTTCGA TCTATTCTCC 
AGATCACACT AACAACAGCT TTTCATCAAA CCCTTCAACT CCTGTTGGCT CTCCTCCATC 

15 TCTCTCAGCA GGCACAGCTG TTTGGTCTAG AAATGGAGGA CAGGCCTCAT CGTCTCCTAA 
TTATGAAGGA CCCTTACACT CTCTGCAAAG COGAATTGAA GATCGTTTAG AAAGACTGGA 
TGATGCTATT CATGTTCTCC GGAACCATQC AOTGGGCCCA TCCACAGCTA TGCCTGGTGG 
TCATGGGGAC ATGCATGGAA TCATTGGACC TTCTCATAAT GGAGCGATGG GTGGTCTGGG 
CTCAGQGTAT GGAACCGGCC TTCTTTGAGC CAACAGACAT TCACTCATGG TGGGGACCCA 

20 TOGTGAAGAT GGCGTGGCCC TGAGAGGCAG CCATTCTCTT CTGCCAAACC AGGTTCCGGT 
TCCACAGCTT CCTGTCCAGT CTGOGACTTC CCCTGACCTG AACCCACCCC AGGACCCTTA 
CAGAGGCATG CCACCAGGAC TACAGGGGCA GAGTOTCTCC TCTGGCAGCT CTGAGATCAA 
ATCOGATGAC GAGGGTGATG AGAACCTGCA AGACACGAAA TCTTCGGAGG ACAAGAAATT 
AGATGAGGAC AAGAAGGATA TCAAATCAAT TACTAGCAAT AATGACGATG AGGACCTGAC 

25 ACCAGAGCAG AAGGCAGAGC GTGAGAAGGA GCGGAGGATG GCCAACAATG CCCGAGAGCG 
TCTGCGGGTC CGTGACATCA ACGAGGCTTT CAAAGAGCTC GGCCGCATGG TGCAGCTCCA 
CCTCAAGAGT GACAAGCCCC AGACCAAGCT CCTGATCCTC CACCAGGCGG TGGCCGTCAT 
CCTCAGTCTG GAGCAGCAAG TCCGAGAAAG GAATCTGAAT CCGAAAGCTG CGTGTCTGAA 
AAGAAGGGAG GAAGAGAAGG TGTOCTCGGA GCCTCCCCCT CTCTCCTTGG CCGGCCCACA 

30 CCCTGGAATG GGAGACGCAT CGAATCACAT GGGACAGATG 2MAAGGGTC CAAGTTGCCA 
CATTGCTTCA TTAAAACAAO AGACCACTTC CTTAACAGCT GTATTATCTT AAACCCACAT 
AAACACTTCT CCTTAACCCC CATTTTTGTA ATATAAGACA AGTCTGAGTA GTTATGAATC 
GCAGACGCAA GAGGTTTCAG CATTCCCAAT TATCAAAAAA CAGAAAAACA AAAAAAAGAA 
AGAAAAAAGT GCAACTTGAG GGACGACTTT CTTTAACATA TCATTCAGAA TGTGCAAAGC 

35 AGTATGTACA GGCTGAGACA CAGCCCAGAG ACTGAACGGC 

Seq ID NO: 199 Protejfl fl^ff! 
Protein Accession ft: NP_003 190.1 

40 1 11 21 31 41 51 

J I I I I I 

MHHQQRMAAL GTDKELSDLL DFSAMFSPPV SSGKNGPTSL ASGHFTGSNV BDRSSSGSWG 60 

NGGHPSPSRN YGDGTPYDHM TSRDLGSHDN LSPPPVNSRI QSKTBRGSYS SYGRESNLQG 120 

CHQQSLLGGD MDMGNPGTLS PTKPGSQYYQ YSSNNPRRRP LHSSAMEVQT KKVRKVPPGL 18 0 

45 PSSVYAPSAS TADYNRDSPG YPSSKPATST FPSSPFMQDG HRSSDPWSSS SGKNQPGYAG 240 

MLGNSSHIPQ SSSYCSLHPH ERLSYPSHSS ADINSSLPPM STFHRSGTNH YSTSSCTPPA 300 

NGTDSIMANR GSGAAGSSQT GDALGKALAS IYSPDHTNN3 FSSNPSTPVG SPPSLSAGTA 360 

VWSRNGGQAS SSPNYEGPLH SLQSRIEDRL ERLDDAIHVL RNHAVGPSTA MPGGHGDMHG 420 

IIGPSHNGAM GGLGSGYGTG LLSANRHSLM VGTHREDGVA URGSHSLLPN QVPVPQLPVQ 480 

50 SATSPDLNPP QDPYRGMPPG LQGQSVSSGS SEIKSDDEGD ENLQDTKSSE DKKLDDDKKD 540 

IKSITSKNDD BDLTPSQKAE RBKERRMANN ARBRLRVROI NEAFKELGRM VQLHLKSDKP 600 

QTKLLILHQA VAVILSLEQQ VRERNLNPKA ACLKRREEEK VSSEPPPLSL AGPHPGMGDA 660 
SNHMGQM 

55 Seq ID NO: 200 DNA sequence 

Nucleic Acid Accession »: BC00S987 (1-1286), BB888744 (1287-1756) 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

60 | | | | | | 

GGCAGAAGAG GAAGATTTCT GAAGAGTGCA GCTGCCTGAA CCGAGCCCTG CCGAACAGCT 60 

GAGAATTGCA CTGCAACCAT_£AGTGAGAAC AATAAGAATT CCTTGGAGAG CAGCCTACGG 120 

CAACTAAAAT GCCATTTCAC CTGGAACTTG ATGGAGGGAG AAAACTCCTT GGATGATTTT 180 

GAAGACAAAG TATTTTACCG GACTGAGTTT CAGAATCGTG AATTCAAAGC CACAATGTGC 240 

65 AACCTACTGG CCTATCTAAA GCACCTCAAA GGGCAAAAGG AGGCAGCCCT GGAATGCTTA 300 

CGTAAAGCTG AAGAGTTAAT CCAGCAAGAG CATGCTGACC AGGCAGAAAT CAGAAGTCTG 360 

GTCACCTGGG GAAACTATGC CTGOGTCTAC TATCACATGG GCOGACTCTC AGACGTTCAG 420 

ATTTATGTAG ACAAGGTGAA ACATGTCTGT GAGAAGTTTT CCAGTCCCTA TAGAATTGAG 460 

AGTCCAGAGC TTGACTGTGA GGAAGGGTGG ACACGGTTAA AGTGTCGARG AAACCAAAAT 540 

70 GAAAGAGCGA AGGTGTGCTT TGAGAAGGCT CTGGAAAAGA AGCCAAAGAA CCCAGAATTC 600 

ACCTCTGGAC TGGCAATAGC AAGCTACCGT CTGGACAACT GGCCACCATC TCAGAAOGCC 660 

ATTGACCCTC TGAGGCAAGC CATTCGGCTG AATCCTGACA ACCAGTACCT TAAAGTCCTC 720 

CTGGCTCTGA AGCTTCATAA GATGCGTGAA GAAGGTGAAG AGGAAGGTGA AGGAGAGAAG 780 

TTAGTTGAAG AAGCCTTGGA GAAAGCCCCA GGTGTAACAG ATGTACTTCG CAGTGCAGCC 840 

75 AAGTTTTATC GAAGAAAAGA TGAGCCAGAC aaagcgattg aactgcttaa AAAGGCTTTA 900 

GAATACATAC CAAACAATGC CTACCTGCAT TGCCAAATTG GGTGCTGCTA TAGGGCAAAA 960 



460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
I860 
1920 
I960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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GTCTTCCAAG TAATGAATCT AAGAGAGAAT GGAATGTATG GGAAAAGAAA GTTACTGGAA 1020 

CTAATAGGAC ACGCTOTGGC TCATCTGAAG AAAGCTOATG AGGCCAATGA TAATCTCTTC 1080 

OGTOTCTGTT CCATTCTTQC CAGCCTCCAT GCTCTAGCAG ATCAGTATGA AGAAGCAGAG 1140 

TATTACTTCC AAAAGGAATT CAGTAAAGAG CTTACTCCTG TAGOGAAACA ACTGCTCCAT 1200 

CTGCGGTATG GCAACTTTCA GCTGTACCAA ATGAAGTOTQ AAGACAAGGC CATCCACCAC 1260 

TTTATAGAGG GTGTAAAAAT AAACCAGAAA TCAAGGGAGA AAGAAAAGAT GAAAGACAAA 1320 

CTOCAAAAAA TTGCCAAAAT GOGACTTTCT AAAAATGOAG CAGATTCTGA GGCTTTGCAT 1380 

GTCTTOGCAT TCCTTCAGGA GCTGAATGAA AAAATGCAAC AAGCAGATGA AGACTCTGAG 1440 

AGGGGTTTGG AGTCTGGAAG CCTCATCCCT TCAGCATCAA GCTGGAATGG GGAATGAAGA 1500 

ATAGAGATGT GGTGCCCACT AGGCTACTGC TGAAAGGGAG CTGAAATTCC TCCACAAGTT 1560 

GGTATTCAAA ATATGTAATG ACTGGTATGG CAAAAGATTQ GACTAAGACA CTGGCCATAC 1620 

CACTGGACAG GGTTATGTTA AACCTGAATT GCTOGGTCTT AAAAGAGCCC AAGGAQTTCT 1680 

GGGAGAGGGA CAGATTGGGG GGTCGTCCAG GGCTGCGCTA AATTATTCTC AATGATTTGT 1740 
CTCITTGCGG AACTTC < 



Seq ID NO: 201 Protein sequence: 
Protein Accession #: AAA59191 



1 11 21 31 41 51 

I 1 I I I I 

MSBNNKNSLB SSLRQLKCHF TWNLMEGEN8 LDDPBDKVFY RTEFQNRBFK ATMCULLAYL 60 

KHUCGQNBAA LECLRXAEEL IQQEHADQAE IRSLVTWGNY AWVYYHMGRL SDVQIYVDKV 120 

KHVCEXFSSP YRIESPELDC REGWTRLKCG GNQNBRAKVC FBKALEKKPK NPEFTSGLAI 1B0 

ASYRLDNWPP SONAIDPLRQ AIRLNPDNQY LKVLLALKLH KMREEGEEEG EGEKLVEEAL 240 

EKAPGVTDVL RSAAKFYRRK DBPDKAIELL KKALEYIPNN AYLHCQIGCC YRAKVFQVMN 300 

LRBNGMYGKR KLLELIGHAV AHLKKADEAN DNLFRVCSIL ASLHALADQY EDAEYYFQKE 360 

FSKELTPVAK QLLHLRYGNF QL.YQMKCEDK AIHHFIBGVX INQKSREKEK NKDKLQKIAK 420 
MRLSKNGADS EAIHVXAFLQ ELNEKKQQAD BDSERGLESG SLIPSASSWN GE 



Seq ID NO: 202 DNA sequence 
Nucleic Acid Accession #: NM_003090 

Coding sequence: 57-824 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAATTCCGCG GGAGGCCACG GGCTTTCCAC AGCGCGGGGG AACGGGAGGC TGCAGGATGG 60 

TCAAGCTGAC GGCGGAGCTG ATCGAGCAGG CGGCGCAGTA CACCAACGCG GTGCGCGACC 120 

GGGAQCTGGA CCTCCGGGGG TATAAAATTC CCGTCATTOA AAATCTAGGT GCTACOTTAG 180 

ACCAGTTTGA TGCTATTGAT TTTTCTGACA ATQAGATCAG GAAACTGGAT GGTTTTCCTT 240 

TGTTGAGAAG ACTGAAAACA TTGTTAGTGA ACAACAACAG AATATGCCQT ATAGGTGAGG 300 

GACTTGATCA GGCTCTGCCC TGTCTGACAG AACTCATTCT CACCAATAAT AGTCTCGTGG 360 

AACTGGGTGA TCTGGACCCT CTGGCATCTC TCAAATCGCT GACTTACCTA AGTATCCTAA 420 

GAAATCOGGT AACCAATAAG AAGCATTACA GATTGTATGT GATTTATAAA GTTCCGCAAG 480 

TCAGAGTACT GGATTTCCAG AAAGTGAAAC TAAAAGAGCG TCAGGAAGCA GAGAAAATGT 540 

TCAAGGGCAA ACGGGGTGCA CAGCTTGCAA AGGATATTGC CAQGAGAAGC AAAACTTTTA 600 

ATCCAGGTGC TGGTTTGCCA ACTGACAAAA AQAGAGGTGG GCCATCTCCA GGGGATGTAG 660 

AAGCAATCAA GAATGCCATA GCAAATGCTT CAACTCTGGC TGAAGTGGAG AGGCTGAAGG 720 

GGTTGCTGCA GTCTGGTCAG ATCCCTGGCA GAGAACGCAG ATCAGGGCCC ACTGATGATG 780 

GTGAAGAAGA GATGGAAGAA GACACAGTCA CAAACGGGTC CTGAGCAGTG AGGCAGATGT 840 

ATAATAATAG GCCCTCTTGG AACAAGTCTT GCTTTTCGAA CATGGTATAA TAGCCTTGTT 900 

TGTGTTAGCA AAGTGGAATC TATCAGCATT GTTGAAATGC TTAAGACTGC TGCTGATAAT 960 

TTTGTAATAT AAGTTTTGAA ATCTAAATGT CAATTTTCTA CAAATTATAA AAATAAACTC 1020 
CACTCTCTAT GCTAAAAAAA AAAAAAAGGA ATTC 

Seq ID NO i 203 Protein sequence: 
Protein Accession 8: NP_003081.1 



1 11 21 

I I I 

HVKLTAELIS QAAQYTNAVR DRELDLRGYK 
PLLRRLKTLL VNNNRICRIG EGLDQALPCL 
LRNPVTNKKH YRLYVIYKVP QVRVLDFQKV 
FNPGAGLPTD KKRGGPSPGD VEAIKNAIAN 
DGEEEMEEDT VTNGS 



31 41 SI 

I I I 

IPVIENLGAT IiDQPDAIDPS DNEIRKJjDGF 60 

TELILTNNSL VELGDLDPLA fiLKSLTYLSI 120 

KDKERQEABX MFKGKRGAQL AKDIARRSKT 180 

ASTLABVERL KGLLQSGQIP GRERRSGPTD 240 



Seq ID NO: 204 DNA sequence 

Nucleic Acid Accession #: NM_017643.1 

Coding sequence: 169-1401 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

lit! I I 

AATAGCAATA GCTTTATAGC AGCTCCGGTT ACCTGTTTTA AACATGGAAG GAGAGTCGCT 60 

CCCAGATAGC CCTCACGAGT GOCCCTGGAG CAGGGAGTGG TGGAGCAGAT CTTCCTTGTT 120 

TGGGAGGAGC CTGAGGTGGA CCTCGCGTCC TGAGTCTGGA AGGCACC TAT GG GGACCTQC 180 

TGGGGTGATA TCTCAGAAAA TGTGAGAOTA GAAGTTCCCA ATACAGACTG CAGCCTACCT 240 
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ACCAAAGTCT TCTGGATTGC TGGAATTGTA AAATTAGCAG GTTACAATGC CCTTTTAAGA 300 

TATGAAGGAT TTGAAAATGA CTCTGGTCTO GACTTCTGGT GCAATATATG TGGTTCTGAT 360 

ATOCATCCAG TTGGTTGGTG TGCAGCCAGC GGAAAACCTC TPGTTCCTCC TAGAACTATT 420 

CAGCATAAAT ATACAAACTG GAAAGCTTTT CTAGTGAAAC GACTTACTGG TGCCAAAACA 480 

CTGCCTCCTG ATTTCTCCCA AAAGGTTTCA GAGAGTATGC AOTATCCTTT CAAACCTTGC 540 

ATGAGAGTAG AAGTGGTTGA CAAGAGGCAT TTGTGTCGAA CACGAGTAGC AGTGGTGGAA 600 

AGTGTAATTG GAGGAAGATT AAGACTAGTG TATGAAGAAA GCGAAGATAG AACAGATGAC 660 

TTCTGGTGCC ATATGCACAG CCCATTAATA CATCATATTG GTTGGTCTOG AAGCATAGGT 720 

CATCGATTCA AAAGATCTGA TATTACAAAG AAACAGGATG GACATTTTGA TACACCACCA 780 

CATTTATTTG CTAAGGTAAA AGAAGTAGAC CAGAGTGGGG AATGGTTCAA GGAAGGAATG 840 

AAATTGGAAG CTATAGACCC ATTAAATCTT TCTACAATAT GTGTCGCAAC CATTAGAAAG 900 

GTGCTAGC7G A0GGATTCC7 GATGATTGGG ATCGATGGCT CAGAAGCAGC AGACGGATCI 960 

GACTGGTTCT GTTACCATGC AACCTCTCCT TCTATTTTCC CTGTCGGTTT CTGTGAAATT 1020 

AACATGATTG AACTTACTCC ACCCAGAGGT TACACAAAAC TTCCTTTTAA ATGGTTTGAC 1080 

TACCTCAGGG AAACTGGCTC CATTGCAGCA CCAGTAAAAC TATTTAATAA GGATGTTCCA 1140 

AATCAOGGAT TTCGTGTAGG AATGAAATTA GAAGCAGTAG ATCTCATGGA GCCACGTTTA 1200 

ATATGTGTAG CCACAGTAAC TCGAATTATT CATCGTCTCT TGAGGATACA TTTTGATGGA 1260 

TGGGAAGAAG AGTATGATCA GTGGGTAGAC TGTGAGTCAC CTGACCTCTA TCCTGTAGGG 1320 

TGGTGTCAGT TAACTGGATA TCAACTACAG OCTCCAGCAT CACAGTGTAA GTTGGTATAC 1380 

AGAAAAGGTG TCCTTTTGIA^ AAAATCAGCA ATTCTCCAGA GGACTATCTC ACATAAGTCA 1440 

TCTTATGAGC TCACAGGACA AGAATATACC TATGTCTQAT TGGTTGCCAG GTAAGACATT 1500 

AAGACTCAAC AACAATATCA CAGAATCAGA CCATGTGTCC CATGGCAATG 7GAATCCAAT 1560 

AGTCAATTAC ATAATGACTA TAGAAACACA ACAGTCACCA AATTAAACTA GACTTACTAT 1620 

TTTAGTGAGT TAAAAATTAC ATACTAAAAG TTTATTGGTA GGTAATAAAT GCTTTTGAGT 1680 

AAATAGTGGA AAATGTCTCA TGTTGAGGCT ATGGTTTTGT AGGAACAAGT ACCCTTATTT 1740 

TCAGAGCATC ATGTACTTAA GTATAATGGT CTTGGTAAAG ATAGTTCATA TAAGTTGTAT 1800 

CTAGACAACT GTATCGTCTA AATTGTAAAC AATTATCTAG TACCAATTTT CCCTTTTTAT 1860 

TTTTCAGCAT CAAGAGAAAA CCAATCAGCT TCATCAAAAC AGAAGAAAAA GGCTAAGTCC 1920 

CAGCAATACA AAGGACATAA GAAAAGTGGG TCACCACGTG GTGTTCACAT ACATTTTCTA 1980 

ATTGTTAACT AATTGGAGTC ACAGTATTCT TGGACAGAAA ATGATATATC TTGTGAGAAC 2040 

TGATGATTGT GCATTATGTA TTATGCTTAA AGGTGCAGTA TGCCATAAAA GGCAAACCCT 2100 

TGCAATAATG AGAAACACTG ATATTTTACT AACAGGAGAA ATGATTACCA CAGTATTTAA 2160 

AGTATACGTG GTAAAGAATA GAQTCTGTGA ATGATTCTTG AAATAATATG TAAAACCTAC 2220 

TGAAAGTTAA TCCTTTTTAA AAACTTTATT TAAAAAGAAA AATTAGCAGC CAGGTGCAGT 2280 

GGCTCACGCC TGTAATCCCA GCACTTTAGG AGGCCGAGGC TGGCAGATCA CAAGGTCAGG 2340 

AGATCGAGAC CATCCTGGCT AACACGGTGA AACCCTGTCT CCACCAAAAA TACAAAAAAT 2400 

CTGCCGGGCG TGGTCGCACA OGCCTGAAGT CCCAGCTACT CAGGAGGCTG AGGCAAGAGA 2460 

ATCACTTGAA CCCAGGAGGC AGAGGTTGCA GTGGGCCAAG ATCACGCCAC TACATTCCAG 2520 
CTGGGCAACA CAGCAAGACT CTGTCTCAAA AAAAAAAAAA AAAA 

Seq ID HO: 205 Protein, pequenqe; 
Protein Accession #: NP_060113.1 

1 11 '21 31 41 51 

I I ! I I I 

MGTCWGDISE NVRVEVPNTD CSLPTKVFWI AGIVKLAGYN ALLRYEGFEN DSGLDFWCKI 60 

CGSDIHPVGW CAASGKPLVP PRTIQHKYTN HKAFLVKRLT GAKTLPPDFS QKVSESKQYP 120 

PKPCMRVEW DKRHLCRTRV AWESVIGGR ItRLVYEESED RTDDFWCHMH SPLIHHIGWS 180 

RSIGHRFKHS DITKKQDGHP DTPPHLFAKV KBVDQSGEWF KBGMKLEAID PLNfcSTICVA 240 

TIRKVLADGP LMIGIDGSEA ADGSDWFCYH ATSPSIFPVG FCEINMIELT PPRGYTKLPF 300 

KWFOYXiRETG SIAAPVKLFN KDVPNHGPRV GKKLEAVDLM BPRLICVATV TRIIHRLLRI 360 
HFDGWEEEYD QWVDCESPDL YPVGWCQLTG YQLQPPASQC KLVYRKGVLL 



Seq ID NO: 206 PNA, seqwencfi 
Nucleic Acid Accesaion #: NM_012334 

Coding sequence: 223-6399 (underlined sequences correspond to start end stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAGACAAAGG CTGCCGTCGG GACGGGCGAG TTAGGGACTT GGGTTTGGGC GAACAAAAGG 60 

TGAGAAGGAC AAGAAGGGAC CGGGOGATGG CAGCAGGGGA GCCCCGCGGG CGCGCGTCCT 120 

CGGGAGTGGC GCCGTGACAC GCATGGTTTC CCCGGACCCG CXSGCGGCGCT GACTTCCGCG 180 

AGTCGGAGCG GCACTCGGCO AGTCCGGGAC TGCGCTGGAA CAfiSSGATAA CTTCTTCACC 240 

GAGGGAACAC GGGTCTGGCT GAGAGAAAAT GGCCAGCATT TTCCAAGTAC TGTAAATTCC 300 

7GTGCAGAAG GCATCGTCGT CTTCCGGACA GACTATGGTC AGGTATTCAC TTACAAGCAG 360 

AGCACAATTA CCCACCAGAA GOTGACTGCT ATGCACCCCA CGAACJGAGGA GGGCGTGGAT 420 

GACATGGCGT CCTTGACAGA GCTCCATGGC GGCTCCATCA TGTATAACTT ATTCCAGOGG 480 

TATAAGAGAA ATCAAATATA TACCTACATC GGCTCCATCC TGGCCTCCGT GAACCCCTAC 540 

CAGCCCATCG CCGGGCTOTA CGAGCCTGCC ACCATGGAGC AGTACAGCCG GCGCCACCTG 600 

GGCGAGCTGC CCCCGCACAT CTTCGCCATC GCCAACGAGT QCTACCGCTG CCTGTGGAAG 660 

CGCTACGACA ACCAGTGCAT CCTCATCAGT GGTGAAAGTG GGGCAGGTAA AACCGAAAGC 720 

ACTAAATTGA TCCTCAAGTT TCTGTCAGTC ATCAGTCAAC AGTCTTTGGA ATTGTCCTTA 780 

AAGGAGAAGA CATCCTGTGT TGAACGAGCT ATTCTTGAAA GCAGCCCCAT CATGGAAGCT 840 

TTCGGCAATG CGAAGACCGT GTACAACAAC AACTCTAGTC GCTTTGGGAA GTTTGTTCAG 900 
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CTGAACATCT GTCAGAAAGG AAATATTCAG 
AAAAACCGAG TAGTAAGGCA AAATCCCGGG 
CTGGCAGGGC TGGAACATGA AGAAAGAGAA 
CACTACTTGA ATCAGTCTGG ATGTGTAGAA 
5 AGGGAAGTTA TTAC66CAAT GGACGTGATG 
TCGAGGCTGC TTGCTGGTAT ACTGCATCTT 
GCACAGGTTT CCTTCAAAAC AGCTTTGGGC 
ACACAGCTCA CAGATGCTTP GACCCAGAGA 
AOGCCTCTCA ATGTTCAACA GGCAQTAGAC 

10 GCGTGCTGCT TTGAGTGGGT AATCAAQAAG 
TTCAAOTCTA TTGGCATCCT OOACATCTTT 
GAACAGTTCA ATATAAACTA TGCAAACOAG 
TTTTCTTTAG AACAACTAGA ATATAGCCGG 
ATAGACAATG GAGAATGCCT GOACTTGATT 

15 AATGAAGAAA GCCATTTTCC TCAAGCCACA 
CAOCATOOGA ATAACCACTT TTATGTGAAG 
AAGCACTATG CTGGAGAGGT GCAATATGAT 
ACATTTGGAG ATOACCTTCT CAATTTGCTA 
CTTTTTGAAC ATGTTTCAAG CCGCAACAAC 

20 CGGCGGCCTA CAGTCAGCTC ACAGTTCAAG 
AGCTCCTCTA ATCCTTTCTT TGTTCGCTGT 
CAGTTTGACC AGGCGGTTGT GCTGAACCAG 
AGAATCCGCA AAGCTGGGTA TGCGGTCCGA 
AAAGTGCTGA TGAGGAATCT GGCTCTGCCT 

25 CTGCAGCTCT ATGATGCCTC CAACAGCGAG 
CGAGAATCCT TGGAACAGAA ACTGGAGAAG 
ATGGTGATTC GGGCCCATGT CTTGGGCTTC 
TATTGTGTGG TGATAATACA GAAGAATTAC 
CACCTGAAAA AGGCAGCCAT AGTTTTCCAG 

30 GTTTACAGAC AATTGCTGGC AGAGAAAAGG 
GAAGAAAAGA AGAAACGGGA GGAAGAAGAA 
GAGCTCCGCG CCCAGCAGGA AGAAGAAACG 
AAGAGCCAGA AGGAAGCTGA ACTGACCCGT 
GTGGAAGAGA TCCTCCQTCT GGAGAAAGAA 

35 CAGGAGCTGT CGCTGACCGA GGCTTCCCTG 
CTCCGCAGGC TGGAGGAGGA AGCGTGCAGG 
TTCGACGAGA TCGACGAGTG TGTCCGGAAT 
TTTTCCAGCG AGCTGGCTGA GAGCGCATGC 
CCCTACCCAG AGGAGGAGGT CGATGAGGGC 

40 TCCCCCAACC CCAGCGAGCA CGGCCACTCA 
GATGACTCTT CAGAGGAGGA CCCATACATG 
GCGGACAGCA CGGTGCTGCT CGCCCCATCA 
TCCAGOGGCG AGTCCACCTA CTGCATGCCC 
GGCGACTACG ACTACGACCA GGATGACTAT 

45 GTGACCTTCT CCAACTCCTA OGGCAGCCAG 
ACCTACAACA GCTCGGGTGC CTACCGGTTC 
GATAGTGAAG AGGACTTTGA TTCCAGGTTT 
GACTCTGTGT ACAGCTGTGT CACTCTGCCG 
GGCCTGATGA ACTCTTGGAA ACGCCGCTGG 

50 TTCCGCTCCA AGCAGGAGGC CCTCAAGCAA 
TCCACGCTGT CCAGGAGAAA TTGGAAGAAG 
ATGTACTTTG AAAACGACAG CGAGGAGAAG 
AAAGAGATCA TAGATAACAC CACCAAGGAG 
ACTTTCCACC TGATTGCAGA GTCCCCAGAA 

55 CAGGTCCAOG CGTCCACGGA CCAGGAGATC 
CAGAATGCTG TGGGCACCTT GGATGTGGGG 
CCTGATAGAC CCAACTCGTT TGTGATCATC 
GACACGCCGG AGOAGATGCA CCACTGGATA 
AGAGTGGAGG GCCAGGAATT CATCGTGAGA 

60 CCGAAGATGT CTTCACTGAA ACTGAAGAAA 
GATTACTACA AGAOTTCAGA GAAGAACGCG 
CTCTGCTCTG TCGTCCCCCC AGATGAGAAG 
ACOGTGTACG GGOGCAAGCA CTGTTACCGG 
. CGOTGGTCCA GTGCCATTCA AAACGTGACT 

65 CAGCAGCTGA TTCAAGATAT CAAGGAGAAC 
TACAAGOGGA ACCCGATCCT TCGATACACC 
CTTCCGTATG GGGACATAAA TCTCAACTTG 
GATGAGGCCA TCAAGATATT CAATTCCCTG 
CCAATAATCC AGGGCATCCT ACAGACAGGG 

70 TACTGCCAGC TTATCAAACA GACCAACAAA 
TACAGCTGGC AGATCCTGAC ATGCCTGAGC 
AAGTATCTCA AGTTCCATCT GAAAAGGATA 
AAATACGCTC TCTTCACTTA CGAATCTCTT 
TCCCGAGATG AAATAGAAGC TCTGATCCAC 

75 CATGGCGGCG GCTCCTGCAA GATCACCATC 
GAGAAGCTGA TCCGAGGCCT GGCCATGGAG 



GGCGGGAGAA TTGTAGATTA TTTATTAGAA 960 

GAAAGGAATT ATCACATATT TTATGCACTG 1020 

GAATTTTATT TATCTAOGCC AGAAAACTAC 1080 

GACAAGACAA TCAGTGACCA GGAATCCTTT 1140 

CAGTTCAGCA AGGAGGAAGT TCGGGAAGTG 1200 

GGGAACATAG AATTTATCAC TGCTGGTGGG 1260 

AGATCTGCGG AGTTACTTGG GCTGGACCCA 1320 

TCAATGTTCC TCAGGGGAGA AGAGATCCTC 1380 

AGCAGGGACr CCCTGGCCAT GGCTCTGTAT 1440 

ATCAACAGCA GGATCAAAGG CAATGAGGAC 1500 

GGATTTGAAA ACTTTGAGGT TAATCACTTT 156 0 

AAACTTCAGG AGTACTTCAA CAAGCATATT 1620 

GAAGGATTAG TGTGGGAAGA TATTGACTGG 1680 

GAGAAGAAAC TTGGCCTCCT AGCCCTTATC 1740 

GACAGCACCT TATTGGAGAA GCTACACAGT 1800 

CCCAGAGTTG CAGTTAACAA TTTTGGAGTG 1860 

GTCOGAGGTA TCTTGGAGAA GAACAGAGAT 1920 

AGAGAAAGCC GATTTGACTT TATCTACGAT 1980 

CAGGATACCT TGAAATGTGG AAGCAAACAT 2040 

GACTCACTOC ATTCCTTAAT GGCAACGCTA 2100 

ATCAAGCCAA ACATGCAGAA GATGCCAGAC 2160 

CTGCGGTACT CAGGGATGCT GGAGACTGTG 2220 

AGACCCTTTC AGGACTTTTA CAAAAGGTAT 2280 

GAGGAGGTCC GAGGGAAGTG CACGAGCCTG 2340 

TGGCAGCTQG GGAAGACCAA GGTCTTTCTT 2400 

CGGAGGGAAG AGGAAGTGAG CCAOGCGGCC 2460 

TTAGCACGAA AACAATACAG AAAGGTCCTT 2520 

AGAGCATTCC TTCTGAGGAG GAGATTTTTG 2580 

AAGCAACTCA GAGGTCAGAT TGCTCGGAGA 2640 

GAGCAAGAAG AAAAGAAGAA ACAGGAAGAG 2700 

AGAGAAAGAG AGAGAGAGCG AAGAGAAGCC 2760 

AGGAAGCAGC AAGAACTCGA AGCCTTGCAG 2820 

GAACTGGAGA AACAGAAGGA AAATAAGCAG 2880 

ATCGAGGACC TGCAGCGCAT GAAGGAGCAG 2940 

CAGAAGCTGC AGGAGCGGCG GGACCAGGAG 3000 

GCGGCCCAGG AGTTCCTCGA QTCCCTCAAT 3060 

ATCGAGCGGT CCCTGTCGGT GGGAAGCGAA 3120 

GAGGAGAAGC CCAACTTCAA CTTCAOCCAG 3180 

TTOGAAGCCG ACGACGACGC CTTCAAGGAC 3240 

GACCAGCGAA CAAGTGGCAT CCGGACCAGC 3300 

AACGACACGG TGGTGCCCAC CAGCCCCAGT 3360 

GTGCAGGACT CCGGGAGCCT ACACAACTCC 3420 

CAGAACGCTG GGGACTTGCC CTCCCCAGAC 3480 

GAGGACGGTG CCATCACTTC CGGCAGCAGC 3540 

TGGTCCCCOG ACTACCGCTG CTCTGTGGGG 3600 

AGCTCTGAGG GGGCGCAGTC CTCGTTTGAA 3660 

GATACAGATG ATGAGCTTTC ATACCGGCGT 3720 

TATTTCCACA GCTTTCTGTA ■ CATGAAAGGT 3780 

TGCGTCCTCA AGGATGAAAC CTTCTTGTGG 3840 

GGCTGGCTCC ACAAAAAAGG GGGGGGCTCC 3900 

CGCTGGTTTG TCCTCCGCCA GTCCAAGCTG 3960 

CTCAAGGGCA CCGTAGAAGT GCGAACGGCA 4020 

AATGGGATCG ACATCATTAT GGCCGATAGG 4080 

GATGCCAGCC AGTGGTTCAG CGTGCTGAGT 4140 

CAGGAGATGC ATGATGAGCA GGCAAACCCA 4200 

CTGATTGATT CTGTGTGTGC CTCTGACAGC 4260 

ACGGCCAACC GGGTGCTGCA CTGCAACXSCC 4320 

ACCCTGCTGC AGAGGTCCAA AGGGGACACC 4380 

GGATGGTTGC ACAAAGAGGT GAAGAACAGT 4440 

CGGTGGTTTG TACTCACCCA CAATTCCCTG 4500 

CTCAAACTGG GGACCCTGGT CCTCAACAGC 4560 

ATATTCAAAG AGACAGGCTA CTGGAACGTC 4620 

CTCTACACCA AGCTGCTCAA CGAGGCCACC 4680 

GACACCAAGG CCCCGATCGA CACCCCCACC 4740 

TGCCTGAACT CGGATGTGGT GGAACAGATT 4800 

CATCACCCCT TOCACTCCCC GCTCCTGCCC 4860 

CTCAAAGACA AAGGCTATAC CACCCTTCAG 4920 

CAGCAACTGG AGTCCATGTC TGACCCAATT 4980 

CATGACCTGC GACCTCTGCO GGACGAGCTG 5040 

QTGCCCCACC CCGGCAGTGT GGGCAACCTG 5100 

TGCACCTTCC TGCCGAGTCG AGGGATTCTC 5160 

CGGGAACAGT TTCCAGGAAC CGAGATGGAA 5220 

AAGAAAACCA AATGCCGAGA GTTTGTGCCT 5280 

AGGCAGGAAA TGACATCCAC GGTCTATTGC 5340 

AACTCCCACA CCACTGCTGG GGAGGTGGTG 5400 

GACAGCAGGA ACATGTTTGC TTTGTTTGAA 5460 
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TACAACGGCC ACGTCGACAA AGCCATTGAA AGTCGAACCQ TOGTAGCTGA TGTCTTAGCC 5520 

AAGTTTGAAA AGCTGGCTGC CACATCCGAG GTTGGGGACC TGCCATGGAA ATTC TACTTC 5580 

AAACTTTACT GCTTCCT60A CACAGACAAC GTGCCAAAAG ACAGTGTGGA GTTTGCATTT 5640 

ATQTTTGAAC AGGCCCAOGA AGCGGTTATC CATGGCCACC ATCCAGCCCC GGAAGAAAAC 5700 

5 CTCCAGGTTC TTGCTGCCCT GCGACTCCAO TATCTGCAGG OGGATTATAC TCTGCAOGCT 5760 

GCCATCCCAC CTCTCGAAGA GGTTTATTOC CTGCAGAGAC TCAAGGCCCG CATCAGCCAG 5820 

TCAACCAAAA CCTTCACCCC TTGTGAACGG CTGGAGAAGA GGCGGACGAG CTTCCTAGAG 5880 

GGGACCCTGA GGCGGAGCTT CCGGACAGGA TCCGTQGTCC GGCAGAAGGT OQAGGAGGAG 5940 

CAGATGCTGG ACATGTGGAT TAAGGAAGAA GTCTCCTCTG CTCGAGCCAG TATCATTGAC 6000 

10 AAGTGGAGGA AATTTCAGGG AATGAACCAG GAACAGGCCA TGGCCAAGTA CATGGCCTTG 6060 

ATCAAGGAGT GGCCTGGCTA TGGCTOGAOG CTGTTTGATG TGOAGTGCAA GGAAGGTGGC 6120 

TTCCCTCAGG AACTCTGGTT GGGTGTCAGC GCGGACGCGG TCTCCGTCTA CAAGCGTGGA 6180 

GAGGGAAGAC CACTGGAAGT CTTCCAGTAT GAACACATCC TCTCTTTTOG GGCACCCCTG 6240 

GCGAATACGT ATAAGATCGT GGTCGATGAG AGGGAGCTGC TCTTTGAAAC CAGTGAGGTG 6300 

15 GTGGATSTGG CCAAGCTCAT GAAAGCCTAC ATCAGCATGA TOOTGAAGAA GCGCTACAGC 6360 

ACGACACGCT CCGCCAGCAG CCAGGGCAGC TCCAGGTCAA GGCGGGACAG AGCCCACCTG 6420 

TCTTTGCTAC CTGAACGCAC CACCCTCTGG CCTAGGCTGG CTCCAGTGTG CCATGCCCAG 6480 

CCAAAACAAA CACAGAGCTG CCCAGGCTTT CTGGAAGCTT CTGGTCTGAG GGAGGTGTCT 6540 

CCGAGGATCC TTTTGCCTGC CGCCTTCATT GATCCTGTAT TAAGCTGTCA ACTTTAACAO 6600 

20 TCTGCACAGT TTCCAAAGCT TTACTACTCT TAGAGGACAC ATGCCTTAAA AAAGGAGGGG 6660 

AGGAACCACG CTGCCACCAA AGCAGCCGGA AGTGCCTTAA CTTOTGGAAC CAACACTAAT 6720 

OGACCQTAAC TGTGCTACTO AAGGGAACTG CCTTTCCCCC TTCTGGGGGA GACTTAACAG 6780 

AGCGTGGAAG GGGGGCATTC TCTGTCAATG ATGCACTAAC CTCCCAACCT GATTTCCCCG 6840 

AATCTGAGGG AAG6TGAGGG AGTGGGAAGG GGGATQGAGA GCTOGAGGGG ACAGTGTGTT 6900 

25 TGAGCTGGAG TGCTGCGGGC AGCCTTTCTC ATGGAATGAC ATGAATCAAC nTimTO 6960 

GTTTCATCTT TTAAGTGTAC GTGCTTGCCT GTTCGTGCAT GTGTTCATAA ACTCAACACT 7020 

TTAATCATGG TTTCATGAGC ATTAAAAAGC AAAGGGAAAA AGGATGTGTA ATGGTGTACA 7080 

CAGTCTGTAT ATTTTAATAA TGCAGAGCTA TAGTCTCAAT TGTTACTTTA TAAGGTGGTT 7140 

TTATTAACAA ACCCAAATCC TGGATTTTCC TGTCTTTGCT GTATTTTGAA AAACACGTGT 7200 

30 TGACTCCATT GTTTTACATG TAGCAAAGTC TGCCATCTGT GTCTGCTGTA TTATAAACAG 7260 

ATAAGCAGCC TACAAGATAA CTGTATTTAT AAACCACTCT TCAACAGCTG GCTCCAGTGC 7320 

TGGTTTTAGA ACAAGAATGA AGTCATTTTG GAGTCTTTCA TGTCTAAAAG ATTTAAGTTA 7380 

AAAACAAAGT GTTACTTGGA AGGTTAGCTT CTATCATTCT GGATAGATTA CAGATATAAT 7440 

AACCATGTTG ACTATGGGGG AGAGACGCTO CATTCCAGAA ACGTCTTAAC ACTTGAGTGA 7500 

35 ATCTTCAAAG GACCCTGACA TTAAATGCTG AGGCTTTAAT ACACACATAT TTTATCCCAA 7560 

GTTTATAATG GTGGTCTGAA CAAGGCACCT GTAAATAAAT CAGCATTTAT GACCAGAAGA 7620 

AAAATAATCT GGTCTTGGAC TTTTTATTTT TATATGGAAA AGTTTTAAGG ACTTGGGCCA 7680 

ACTAAGTCTA CCCACACGAA AAAAGAAATT TGCCTTGTCC CTTTGTGTAC AACCATGCAA 7740 
AACTGTTTGT TGGCTCACAG AAGTTCTGAC AATAAAAGAT ACTAGCT 



40 



fieq ID NO: 207 Proftqirj sequence j 
Protein Accession #: NP 036466 



1 11 21 31 41 51 

45 | | | | | J 

MDNFFTEGTR VWLRENGQHF PSTVNSCAEG IWFRTDYGQ VFTYKQSTIT HQKVTAMHPT 60 

NEEGVDDMAS LTELHGGSIM YNLFQRYKRN QIYTYIGSIL ASVNPYQPIA GLYEPATMEQ 120 

YSRRHLGELP PHIPAIANEC YRCLWKRYDN QCILISGBSG AGKTBSTKLI LKFLSVISQQ 180 

SLELSLKEKT SCVERAILES SPIMEAFGNA KTVYNNNSSR FGKFVQLNIC QKGNIQGGRI 240 

50 VDYLLEKNRV VRQNPGERNY HIFYALLAGL EHEEREEFYL STPENYKYLN QSGCVEDKTI 300 

SDQESFREVI TAMDVMQPSK EEVREVSRUj AGILHLGNIE FITAGGAQVS FKTALGRSAE 360 

LLGLDPTQliT daltqrsmfi, rgbeiltpln vqqavdsrds LAMALYACCF EWVIKKIHSR 420 

IKGNEDFKSI GILDIFGFEN PHVNHFEQFN INYANEKLQE YFNKHIPSLE QLEYSREGLV 480 

WEDIDWIDNG ECLDLIBKKL GUALINEES HPPQATDSTL LEKLHSQHAN NHFYVKPRVA 540 

55 VNNFGVKHYA GEVQYDVRGI LEKNRDTFRD DLLNLLRESR FDFIYDLFEH VSSRNNQDTL 600 

KCGSKHRRPT VSSQFKDSLH SLMATLSSSN PFFVRCIKPN MQKMPDQFDQ AWLNQLRYS 660 

GMLETVRIRK AGYAVRRPFQ DFYKRYKVLM RNIALPEDVR 0KCT8LLQLY DASNSBKQLG 720 

KTKVFLRESL EQKLSKRREE EVSHAAMVIR AHVLGPIARK QYRKVLYCW IIQKNYRAFL 780 

LRRRFLHLKK AAIVFQKQLR GQIARRVYRQ LLAEKREQBB KKKQEBEEKK KREEEERERE 640 

60 RERREAELRA QQEEBTRKQQ BLBALQKSQK EAELTRELEK QKEHKQVEEI LRLBKBIEDL 900 

QRMKEQQBLS LTEASLQKLQ ERRDQELRRL EEEACRAAQE FLB9LNFDEI DBCVRNIERS 960 

LSVGSBFSSE LAESACEEKP NFNFSQPYPE EEVDBGFEAD DDAFKDSPNP SEHGHSDQRT 1020 

SGIRTSDDSS EEDPYMNDTV VPTSPSADST VUAPSVQDS GSLHNSSSGE STYCMPQNAG 1080 

DLPSPDGDYD YDQDDYEDGA ITSGSSVTPS NSYGSQWSPD YRCSVGTYNS SGAYRFSSEG 1140 

65 AQSSFEDSBB DFOSRFDTDD BLSYRRD3VY SCVTLPYFHS FLYKKGGL*1N SWKRRWCVLK 1200 

DETFLWPRSK QEALKQGWLH KKGGGSSTLS RRNWKKRWPV LRQSKLMYFE NDSEEKLKGT 1260 

VEVRTAKBII DNTTKENGID IIMADRTFHL IAESPBDASQ WFSVLSQVHA STDQBIQEMH 1320 

DEQANPQNAV GTIjDVGLIDS VCASDSPDRP NSFVIITANR VLHCNADTPE EMHHVflTLLQ 1380 

RSKGDTRVEG QEFTVRGWLH KSVKKSPKM8 SLKLKKKWFV LTHNSLDYYK SSEKNAKKLG 1440 

70 TLVLNSLCSV VPPDEKIFKE TGYWNVTVYG RKHCYRLYTK LLNRATRWSS AIQNVTDTKA 1500 

PIDTPTQQLI QDIKENCLNS DWEQIYKRN PILRYTHHPL HSPLLPLPYG DINLNLLKDK 1560 

GYTTLQDEAI KIFNSLQQLE SMSDPIPIIQ GILQTGHDLR PLRDELYCQL IKQTNKVPHP 1620 

GSVGNLYSWQ ILTCLSCTFL PSRGIUCYLK FHLKRIREQF PGTEMEKYAL FTYBSUOCTK 1680 

CREFVPSRDB IEALIKRQEM TSTVYCHGGG SCKITINSHT TAGBWEKLI RGLAMEDSRN 1740 

75 MFAIiFEYNGH VDKAIESRTV VADVLAKPEK IAATSBVGDL PWKFYFKLYC FLDTDNVPKD 1800 

SVEFAFMFEQ AHEAVTHGHH PAPEENLQVL AALRLQYLQG DYTLHAAIPP LEBVYSLQRL 1860 
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KARISQSTKT FTPCBRLEKR RTSFLEGTLR RSFRTGSWR QXVRBEQKLD MWIKEEVSSA 1920 

RASIIDKWRK FQGMNQEQAM AKYMALIKKW PGYGSTLFDV KCKEGGPPQB LWLGVSADAV 1980 

SVYKRGEGRP LBVFQYEHIIi SFGAPliANTY K1WDERELL FBTSEWDVA KLMKAYISMI 2040 
VKKRYSTTRS ASSQGSSR 

Seq ID NO: 308 DNA sequence 

Nucleic Acid Accession #: XM_059761.1 

Coding sequence] 124-535 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CGAAGATCTA TCCAAAATCA AGAAGCCTTT GATTTAGATQ TTGCTQTAAA AQAAAATAAA 60 

GATGATCTCA ATCATGTG6A TTTGAATGTG TGTACAAGCT TTTCOGGCCC GGGTAGGAGT 120 

GGCATGGCTC TTATGGAAOT TAACCTATTA AGTGGCTTTA T0OTGCCTTC AGAAOCAATT 180 

TCTCTGAGCG AGACAGTGAA GAAAGTGGAA TATGATCATG GAAAACTCAA CCTCTATTTA 240 

OATTCTGTAA ATGAAACCCA GTTTTaTGTT AATATTCCTQ CTGTGAGAAA CTTTAAACTT 300 

TCAAATACCC AAGATGCTTC AQTGTCCATA GTOOATTACT ATGAGCCAAO GAGACAOGCG 360 

GTGAGAAGTT ACAACTCTOA AGTGAAGCTQ TOCTCCTOTG ACCTTTOCAQ TGATQTCCAG 420 

GGCTGCCOTC CTTGTGAGGA TCGAGCTTCA GOCTCCCATC ATCACTCTTC AGTCATTTTT 480 

ATTTTCTOTT TCAAGCTTCT OTACTTTATO GAA CTTTO GC TOTGATTTAT TTTTAAAGGA 540 

CTCTOTGTAA CACTAACATT TCCAG TAQTC ACATGTGATT GTTTTOTTTT CGTAOAAGAA 600 

TACTGCTTCT ATTTTGAAAA AAGAGTTTTT TTTCTTTCTA TGGGGTTCCA GGGAT9GTGT 660 

ACAACAGGTC CTAQCATGTA TAGCTGCATA GATTTCTTCA CCTGATCTTT GTGTGGAAGA 720 

TCAGAATGAA TGCAGTTGTG TGTCTATATT TTCCCCTCTC AAAATCTTTT AGAATTTTTT 780 
TGGAGGTGTT TGTTTTCTCC AGAATAAAGG TATTACTTTA G 

Seq ID NO: 209 Protein sequence: 
Protein Accession ftt XP_0 59761.1 

1 11 21 31 41 51 

III 111 

MALMEVNLLS GFMVPSEAIS LSBTVKKVEY DHGKLNLYLD SVNETQPCVN IPAVRNFKVS 60 
NTQDASVSIV DYYEPRRQAV RSYNSBVKLS SCDLCSDVQG CRPCBDGASG SHHHSSVIFI 120 
PCFKLLYFKB LWL 



8eq ID NO: 210 

Nucleic Acid Accession ft: NM_015472 
Coding sequence: 258-1460 (underlined sequences correspond to start and stop codons) 



1 
I 

GACACACTCC 
AGCTCACATC 
AGAAGACATG 
GACTCTTATC 
TGGTGATTTT 
AGCAAGTGAT 
TCATGAATCC 
AGCCTGATTC 
GOCCTCGACT 
AGCTGGGCAC 
AGCAGTCCTA 
CGGCCACTGG 
CTAGGAAGGC 
CACCAGTGCC 
ACCAQCAQCA 
CCGCAGGGCT 
TGCGOCTTCA 
TGAGGCAGGA 
TTCAGGCTGC 
GCTCAGATCC 
GCCTGGGQTT 
ATGAGATGGA 
CCCGTTTCCC 
AATCTGAAGA 
CCTTTCTAAC 
TTCCTGGGCC 
GCCTCCATGA 
ATTGAGAGTA 



11 
I 

TCTACAACAC 
CTGGCGACTC 
TACATTCAGT 
TTGGCATCTG 
CCAGAAGATG 
CCACGTCACG 
GAAGCCTAGC 
GGGCTCGCAC 
GGCTGGGGGT 
CGGCGCGGGT 
CGACGTGACC 
CCAGAGGTAC 
GATGAATCAG 
TCAGAGGTCC 
GATGGCCCCC 
CATGAG7ATG 
GAGAATCCAG 
AGCTGCCCTC 
TGTCAACCCA 
TTTCCTCAAT 
AGGGTGCTAC 
TACAGGAGAA 
TGATTTCCTT 
CCTGATCCCC 
CTGGCTGTAA 
TCTTGGAAAA 
CTCGTGCTCC 
ACTTAAGTTA 



21 

I 

CAGAGACTCC 
TCAAGAGAGA 
ATCTATTTTG 
CTTCCTGGTG 
AATCCGGCCT 
CAGGACCTAG 
TCGTGGCGGA 
TCGCGCCAGT 
GCCCAGCATG 
GCTGCGGGTA 
GACGAGCTGC 
TTCCTCAATC 
CCTCTGAATC 
ATGGCAGTAT 
AGTACCCTGA 
CCCAATGCGC 
ATGGAGAGAG 
TGTCGACAGC 
CCCACGATGA 
GGAGGGCCAT 
AGTGTCCCCA 
AACGCAGGAC 
GACTGTCTTC 
CTCTTCAATG 
TCACTACCAT 
AGTGATGGAG 
CTCCTTTTTA 
AACATAAATA 



31 
I 

CAAACACAAG 
AACCTCAGAG 
GCATTTTCCC 
GATCTGAACT 
CGGCGCCCCC 
ACACAGACCT 
AGAAGATCCT 
CCAGCACCGA 
TCCGCTCGCA 
GCCCCGCGCA 
CACTGCCCCC 
ACATAGAAAA 
ATATGAACCT 
CCCAGCCAAA 



TGACCACTCA 
AAAGGATTCG 
TCCCCATGGA 
CCCCAGACAT 
ATCATTCGAG 
CAACTCOGGA 
AAACACCCAT 
CAGGAACAAA 
ATGTAGAGTC 
TGTAACTTGG 
CAGAGCAAGT 
TGTTGCCAGT 
AATATTCTAT 



41 

I 

GCCTTATATT 
TGACTAAAAT 
CAATACATCT 
GACCCATAAG 
TCCGCTCCCG 
CGAAGCCCTC 
GCCGGAGTCT 
CTCGTCGGGC 
CTCGTCGCCC 
GCAGCACGCG 
GGGCTGGGAG 
AATCACCACA 
CCACCCTGCC 
TCTCGTOATG 
CCACCCCACT 
GCAGCAGCAG 
AATGCGCCAA 
AGCTOAGACT 
GAGATCCATC 
GGAGCAGAGC 
GGACTTCCTC 
GAACATCAAT 
CGTTGACTTA 
TGCTCTGAAC 
ATGTAGCCAT 
CTGCAGGTGC 
TTAATCATTG 
TTTCATTTTC 



51 
I 

GACTCATTTC 
CTCCATAATG 
CTGCTCATCT 
CCACGCTTAC 
CCGCCTGGGC 
TTCAACTCTG 
TTCTTTAAGG 
GGCCACCCGG 
GCGTCCCTGC 
CACCTCCGCC 
ATGACCTTCA 
IGGCAAGACC 
GTCAGTTCCA 
AATCACCAAC 
CAGAACCCAC 



GAGGAGCTCA 
CTTGCCCCAfl 
ACTAATAATA 
ACTGACAGTG 
AGCAATGTGG 
CCCCAACAGA 
GGAACTTTGG 
AAAAGTGAGC 
GACCTTACAT 
ACCACTTCCC 
CCTGGTTTra 



Seq ID NO: 211 Protein sequence: 
Protein Accession « j NP_0S62B7.1 

1 11 21 31 

I I 1 I 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



41 

I 



51 
I 
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MNPASAPPPL PPPGQQVIHV TQDLDTDLEA LFNSVKNPKP SSWRKKJLPB SFFKEPDSGS €0 

HSRQSSTDSS GGHPGPRLAO GAQHVRSHSS PASLQLGTGA GAAGSPAQQH AHLRQQSYDV 120 

TDBLPLPPGW EMTFTATGQR .YPLNHISKIT TWQDPRKAMN QPLNKMNLHP AVSSTPVPQR 180 

SMAVSQPNLV MNHQHQQQMA PSTLSQQNHP TQNPPAGLMS KPNALTTQQQ QQQKLRLQRI 240 

QHERERZRMR QEELMRQEAA LCRQLPMRAE HAPVQAAVN PPTMTPDMRS ITNNSSDPPL 300 

HGGPYHSREQ STDSGLGLGC YSVPTTPEDF LSNVDEMDTG ENAGQTPMNI NPQQTRPPDP 360 
LDCLPGTUVD LOTLESBDLI PLFNDVESAL HKSEPPLTWL 



Seq ID NO: 212 pNA sequence 
Nucleic Acid Accession ftj KM_018174 

Coding sequence: 176-2194 (underlined sequences correspond to start and stop cod ens) 



CATCTCCCCC AACCTGGGGG TCGTGTTCTT CAACGCCTOC GAGGCCGCGT CGCGGCTGGC 60 

GOGOGOOQAG GATGAGGCGQ AGCTGGGGCT QAOCCTCCTG GCGCAGCTGO GCATCACGCC 120 

TCTGCCACTC AGCCGCGOCC CCGTGCCAGC CAAACCCACC GTGCTCTTCG AQAAGATQQG 180 

CGTGGGCOGG CTGGACATGT ATGTGCTGCA CCCGCCCTCC GCOQGOGCOa AGCGCACCCT 240 

GGCCTCTGTG TGCGCCCTGC TGGTGTOGCA CCCCGCCGGC CCOGGCGAfiA AGGTGOTOCO 300 

COTGCTGTTC CCCGGTTGCA CCCCGCCCGC CTCCCTCCTQ GACGGCCTGG TCCGCCTGCA 360 

GCACTTGAGG TTCCTGCGAG AGCCCGTGGT GACGCCCCAO GACCTGGAGG GGCCGGGGCG 420 

AGCCGAGAGC AAAQAGAGOG TGGGCTCCCG GQACAGCTOQ AAGAGAOAOG OCCTCCTGGC 480 

CACCCACCCT AGACCTGGCC AGGAGCGCCC TGGGGTGGCC CGCAAGGAGC CAGCACGGGC 540 

TGAGGCCCCA CGCAAGACTG AGAAAGAAGC CAAGACCCCC CGGGAGTTGA AGAAAOACCC 600 

CAAACCGAGT GTCTCCCGGA CCCAGCCOCO GGAGGTGCGC CGGGCAGCCT CTTCTGTGCC 660 

CAACCTCAAG AAGACGAATG CCCAGGCGGC ACCCAAGCCC CGCAAAGCGC CCAGCACGTC 720 

CCACTCTGGC TTCCCGCCGG TGGCAAATGG ACCCCGCAOC CCGCCCAGCC TCCGATGTGG 780 

AGAAGCCAGC COCCCCAGTG CAGCCTGCGG CTCTCCGGCC TCCCAGCTGG TGGCCACOCC 840 

CAGCCTGGAG CTGGGGCCGA TCCCAGCCGG GGAGGAGAAG GCACTGGAGC TGCCTTTGGC 900 

CGCCAGCTCA ATCCCAAGGC CAOGCACACC CTCCCCTGAG TCCCACCGOA GCCCCGCAGA 960 

GGGCAGCGAG CGGCTGTCGC TGAGCCCACT GCGGGGCGGG GAGGCCGGGC CAGACGCCTC 1020 

ACCCACAGTG ACCACACCCA CGGTGACCAC GCCCTCACTA CCCGCAGAGG TGGGCTCCCC 1080 

GCACTCOACC GAGGTGGAOG AGTCCCTGTC GGTGTCCTTT GAGCAGGTGC TGCCGCCATC 1140 

CGCCCCCACC AGTGAGGCTG GGCTGAGCCT CCCGCTSCGT GGCCCCCGGO CGOGGCGCTC 1200 

GGCTTCCCCA CACGATGTGG ACCTGTGCCT GGTGTCACCC TGTGAATTTO AOCATCGCAA 1260 

GGCGGTGCCA ATGGCACCGG CACCTGCGTC CCCCGGCAQC TOGAATGACA GCAGTGCCOG 1320 

GTCACAGGAA CGGGCAGGTG GGCTGGGGGC CGAGGAGACG CCACCCACAT CGGTCAGCGA 1380 

GTCCCTGCCC ACCCTGTCTG ACTCGGATCC CGTGCCCCTG GCCCCCGOTG CGGCAGACTC 1440 

AGACGAAGAC ACAGAGGGCT TTGGAGTCCC TCGCCACGAC CCXTTGCCTG ACCCCCTCAA 1500 

GGTCCCCCCA CCACTGCCTG ACCCATCCAG CATCTGCATG GTGGACCCCG AGATGCTGCC 1560 

CCCCAAGACA GCACGGCAAA CGGAGAACGT CAGCCGCACC CGGAAGCCCC TGGCCCGCCC 1620 

CAACTCACGC GCTGCCGCCC CCAAAGCCAC TCCAGTGOCT GCTGCCAAAA CCAAGGGGCT 1680 

TGCTGGTGGG GACCGTGCCA GCCGACCACT CAGTGCCOGG AGTGAGCCCA GTGAGAAGGG 1740 

AGGCCGGGCA CCCCTGTCCA GAAAGTCCTC AACCCCCAAG ACTGCCACTC GAGGCCCGTC 1800 

GGGGTCAGCC AGCAGCCGGC CCGGGGTGTC AOCCACCCCA CCCAAGTCCC CGGTCTACCT 1860 

GGACCTGGCC TACCTGCCCA GCGGGAGCAG CGCCCACCTG GTQGATGAGG AGTTCTTCCA 1920 

GCGCGTGCGC GCGCTCTGCT ACOTCATCAG TGGCCAGGAC CAGCGCAAGG AGGAAGGCAT 1980 

GCGGGCCGTC CTGGACGCGC TACTGGCCAO CAAGCAGCAT TGGGACCGTG ACCTGGAGGT 204 0 

GACCCTGATC CCCACTTTCG ACTCGGTGGC CATGCATACG TGGTACGCAG AGACGCACOC 2100 

CCGGCACCAG GCGCTGGGCA TCACGGTGTT GGGCAGCAAC GGCATGGTGT CCATGCAGQA 2160 

TGACGCCTTC CCGGCCTGCA AGGTGGAGTT CIAGCCCCAT OGCOGACACG CCCCCCACTC 2220 
AGCCCAGCCC GCCTGTCCCT AGATTCAGCC ACATCAGAAA TAAACTGTGA CTACACTTG 

Seq ID NO: 213 Protein sequeqeg; 
Protein Accession #: NPJJ60644.1 

MGVGRLDMYV LHPPSAGAER TIASVCALLV WHPAGPGBKV VRVLFPGCTP PACLLDGLVR 60 

LQKLRPLREP WTPQDLEGP GRAESKESVG SRDSSKREGL LATHPRPGQE RPGVARKEPA 120 

RAEAPRKTEK EAKTPRELKX DPKPSVSRTQ PRBVRRAASS VPNUOCTNAQ AAPKPRKAPS 180 

TSHSGPPPVA NGPRSPPSLR CGEASPPSAA GGSPASQLVA TPSLELGPIP AGEEKALELP 240 

LAASSIPRPR TPSPESHRSP AEGSBRI>SLS PLRGGBAGPD ASPTVTTPTV TTPSLPAEVO 300 

SPHSTEVDES LSVSPEQVLP PSAPTSBAGL SLPLRGPRAR RSASPHDVDL CLVSPCEPEH 360 

RKAVPMAPAP ASPGSSNDSS ARSQBRAGGL GABETPPTSV SESLPTLSDS DPVPLAPGAA 420 

DSDEDTEGPG VPRHDPLPDP LKVPPPLPDP SSICMVDPEM LPPKTARQTE NVSRTRKPIA 480 

RPNSRAAAPK ATPVAAAKTK GLAGGDRASR* PLSARSEPSE KGGRAPLSRK SSTPKTATRG 540 

PSGSASSRPG VfiATPPKSPV YLDLAYLPSG SSAHLVDEBP FQRVHALCYV ISGQDQRKEE 600 

GMRAVLDALL ASKQHWDRDL QVTLIPTPDS VAMHTWYAET HARHGALGIT VW3SNGMVSM 660 
QDDAFPACKV EP 



Seq ID NO: 214 PNA. frequence 

Nucleic Acid Accession #: NM_002019.1 

Coding sequence: 250-4266 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

III III 

GCGGACACTC CTCTCGGCTC CTCCCCGGCA GCGGCGGCGG CTCGGAGCGG QCTCCGQGGC 60 

TCGGGTGCAG CGGCCAGCGG GCCTGGCGGC GAGGATTACC CGGGGAAGTG OTTGTCTCCT 120 
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GGCTGGAGCC GCGAGACGGG CGCTCAGGGC 
GACTCTGGCG GCCGGGTCGT TGGCCGGGGG 
GCQCTCAC Cft TG3TCAGCTA CTGGGACACC 
CTGCTTCTCA CAGGATCTAG TTCAGGTTCA 
5 GGCACCCAGC ACATCATGCA AGCAGGCCAG 
GCCCATAAAT GGTCTTTGCC TGAAATGGTG 
AAATCTGCCT GTGGAAGAAA TGGCAAACAA 
CAAGCAAACC ACACTGGCTT CTACAGCTGC 
AAGGAAACAG AATCTGCAAT CTATATATTT 

10 ATGTACAGTG AAATCCCCGA AATTATACAC 
TGCOGGGTTA CGTCACCTAA CATCACTGTT 
ATCCCTGATG GAAAACGCAT AATCTGGGAC 
ACGTACAAAG AAATAGGGCT TCTGACCTGT 
ACAAACTATC TCACACATOG ACAAACCAAT 

15 CGCCCAGTCA AATTACTTAO AOOCCATACT 
TTGAACAOGA GAGTTCAAAT GACCTGGAGT 
OTAAGGCGAC GAATTGACCA AAGCAATTCC 
ATTGACAAAA TQCAGAACAA AGACAAAGGA 
TCATTCAAAT CTGTTAACAC CTCAGTGCAT 

20 CATCGAAAAC AGCAGGTGCT TGAAACCGTA 
AAAGTGAAGO CATTTCCCTC GCCGGAAGTT 
GAGAAATCTG CTCGCTATTT GACTGGTGGC 
GAGGATGCAG GGAATTATAC AATCTTGCTG 
CTCACTGCCA CTCTAATTGT CAATGTGAAA 

25 TTTCCAOACC CGGCTCTCTA CCCACTGGGC 
GGTATCCCTC AACCTACAAT CAAGTGGTTC 
GCAAGGTGTG ACTTTTGTTC CAATAATGAA 
ATGGGAAACA GAATTGAGAG CATCACTCAG 
ATGGCTAGCA CCTTGOTTGT GGCTGACTCT 

30 TCCAATAAAG TTGGGACTGT GGGAAGAAAC 
GGGTTTCATG TTAACTTGGA AAAAATGCCG 
ACAGTTAACA AGTTCTTATA CAGAGACGTT 
AGAACAATGC ACTACAGTAT TAGCAAGCAA 
ACTCTTAATC TTACCATCAT GAATGTTTCC 

35 GCCAGGAATQ TATACACAGG GGAAGAAATC 
CAGGAAGCAC CATACCTCCT GCGAAACCTC 
ACCACTTTAG ACTGTCATGC TAATGGTGTC 
AACCACAAAA TACAACAAGA GCCTGGAATT 
ATTGAAAGAG TCACAGAAGA GGATGAAGGT 

40 GGCTCTGTGG AAAGTTCAGC ATACCTCACT 
GAGCTGATCA CTCTAACATG CACCTGTGTG 
CTCCTTATCC GAAAAATGAA AAGGTCTTCT 
ATAATGGACC CAGATGAAGT TC CTTTOG AT 
AGCAAGTGGG AGTTTGCCCG GGAGAGACTT 

45 TTTGGAAAAG TGGTTCAAGC ATCAGCATTT 
GTGGCTGTGA AAATGCTGAA AGAGGGGGCC 
GAGCTAAAAA TCTTGACCCA CATTGGCCAC 
TGCACCAAGC AAGGAGGGCC TCTGATGGTG 
TCCAACTACC TCAAGAGCAA ACGTGACTTA 

50 ATGGAGCCTA AGAAAGAAAA AATGGAGCCA 
GATAGCGTCA CCAGGAGCGA AAGCTTTGCG 
AGTGATGTTG AGGAAGAGGA GGATTCTGAC 
GATCTGATTT CTTACAGTTT TCAAGTGGCC 
TGCATTCATC GGGACCTGGC AGCGAGAAAC 

55 ATTTGTGATT TTGGCCTTGC CCGGGATATT 
GATACTCGAC TTCCTCTGAA ATGGATGGCT 
ACCAAGAGCG ACGTGTGGTC TTACGGAGTA 
TCTCCATACC CAGGAGTACA AATGGATGAG 
AGGATOAGAG CTCCTGAGTA CTCTACTCCT 

60 CACAGAGACC CAAAAGAAAG GCCAAGATTT 
CTTCAAGCAA ATGTACAACA GGATGGTAAA 
GGAAATAGTG GGTTTACATA CTCAACTCCT 
ATTTCAGCTC CGAAGTTTAA TTCAGGAAGC 
AAGTTCATGA GCCTGGAAAQ AATCAAAACC 

65 ATGTTTGATG ACTACCAGGG OGACAGCAGC 
TTCACCTGGA CTGACAGCAA ACCCAAGGCC 
AAAAGTAAGG AGTCGGGGCT GTCTGATGTC 
GGGCACGTCA GCGAAGGCAA GCGCAGGTTC 
ATCGCOTGCT GCTCCCCGCC CCCAGACTAC 

70 ATCTAGAGrr TGACACGAAG CCTTATTTCT 
AACTAGCTTT TGCCAGTATT ATGCATATAT 
CAGCTGCTTT TTGTGATTTT TTTAATAGTG 
TCCAGATAGA GAAATAGTGA CAAGTGAAGA 
TGTTAGAGAA ATCCTTCCTA AACCCAATGA 

75 GCACGCAGGA CCAGTTTGAT TGAGGAGCTG 
CTGGGCCAGC CCTGCAGCCC AAAACCCAGG 



GGGGGGCOGG CGGOGGCGAA OGAGAGGACG 180 

AGCGCGGGCA COGGGOGAGC AGGCOGOGTC 240 

GGGGTCCTGC TGTGCGCGCT GCTCAGCTGT 300 

AAATTAAAAG ATCCTGAACT GAGTTTAAAA 360 

ACACTGCATC TCCAATGCAG GGGGGAAGCA 420 

AGTAAGGAAA GCGAAAGGCT GAGGATAACT 480 

TTCTGCAGTA CTTTAACCTT GAACACAGCT 540 

AAATATCTAG CTGTACCTAC TTCAAAGAAG 600 

ATTAGTGATA CAGGTAGACC TTTCGTAGAG 660 

ATGACTGAAG GAAGGGAGCT CGTCATTCCC 720 

AjCTTTAAAAA AGTTTCCACT TGACACTTTQ 780 

AGTAGAAAGG OCTTCATCAT ATCAAATGCA 840 

GAAGCAACAG TCAATGGGCA TTTGTATAAG 900 

ACAATCATAG ATGTCCAAAT AAGCACACCA 960 

CTTGTCCTCA ATTGTACTGC TACCACTCCC 1020 

TACCCTGATG AAAAAAATAA GAGAGCTTCC 1080 

CATGCCAACA TATTCTACAG TGTTCTTACT 1140 

CTTTATACTT GTCGTGTAAG GAGTGGACCA 1200 

ATATATGATA AAGCATTCAT CACTGTGAAA 1260 

GCTGGCAAGC GGTCTTACCG GCTCTCTATG 1320 

GTATGGTTAA AAGATGGGTT ACCTGCGACT 1380 

TACTCGTTAA TTATCAAGGA CGTAACTGAA 1440 

AGCATAAAAC AGTCAAATGT GTTTAAAAAC 1500 

CCCCAGATTT ACGAAAAGGC CGTGTCATCG 1560 

AGCAGACAAA TCCTGACTTG TACCGCATAT 1620 

TGGCACCCCT GTAACCATAA TCATTCCGAA 1680 

GAGTCCTTTA TCCTGGATGC TGACAGCAAC 1740 

CGCATGGCAA TAATAGAAGG AAAGAATAAG 1800 

AGAATTTCTG GAATCTACAT TTGCATAGCT 1860 

ATAAGCTTTT ATATCACAGA TGTGCCAAAT 1920 

ACGGAAGGAG AGGACCTGAA ACTGTCTTGC 1980 

ACTTGGATTT TACTGCGGAC AGTTAATAAC 2040 

AAAATGGCCA TCACTAAGGA GCACTCCATC 2100 

CTGCAAGATT CAGGCACCTA TGCCTGCAGA 2160 

CTCCAGAAGA AAGAAATTAC AATCAGAGAT 2220 

AGTGATCACA CAGTGGCCAT CAGCAGTTCC 2280 

CCOGAGCCTC AGATCACTTG GTTTAAAAAC 2340 

ATTTTAGGAC CAGGAAGCAG CACGCTQTTT 2400 

GTCTATCACT GCAAAGCCAC CAACCAGAAG 2460 

GTTCAAGGAA CCTCGGACAA GTCTAATCTG 2520 

GCTGCGACTC TCTTCTGGCT CCTATTAACC 2580 

TCTGAAATAA AGACTGACTA CCTATCAATT 2640 

GAGCAGTGTG AGCGGCTCCC TTATGATGCC 2700 

AAACTGGGCA AATCACTTGG AAGAGGGGCT 2760 

GGCATTAAGA AATCACCTAC GTGCCGGACT 2820 

ACGGCCAGCG AGTACAAAGC TCTGATGACT 2880 

CATCTGAACG TGGTTAACCT GCTGGGAGCC 2940 

ATTGTTGAAT ACTGCAAATA TGGAAATCTC 3000 

TTTTTTCTCA ACAAGGATGC AGCACTACAC 3060 

GGCCTGGAAC AAGGCAAGAA ACCAAGACTA 3120 

AGCTCCGGCT TTCAGGAAGA TAAAAGTCTG 3180 

GGTTTCTACA AGGAGCCCAT CACTATGGAA 3240 

AGAGGCATGG AGTTCCTGTC TTCCAGAAAG 3300 

ATTCTTTTAT CTGAGAACAA CGTGGTGAAG 3360 

TATAAGAACC CCGATTATGT GAGAAAAGGA 3420 

CCCGAATCTA TCTTTGACAA AATCTACAGC 3480 

TTGCTGTGGG AAATCTTCTC CTTAGGTGGG 3540 

GACTTTTGCA GTCGCCTGAG GGAAGQCATO 3600 

GAAATCTATC AGATCATGCT GGACTGCTGG 3660 

GCAGAACTTG TGGAAAAACT AGGTGATTTG 3720 

GACTACATCC CAATCAATGC CATACTGACA 3780 

GCCTTCTCTG AGGACTTCTT CAAGGAAAGT 3840 

TCTGATGATG TCAGATATGT AAATGCTTTC 3900 

TTTGAAGAAC TTTTACCGAA TGCCACCTCC 3960 

ACTCTGTTGG CCTCTCCCAT GCTGAAGCGC 4020 

TCGCTCAAGA TTGACTTGAG AGTAACCAGT 4080 

AGCAGGCCCA GTTTCTGCCA TTCCAGCTGT 4140 

ACCTACGACC ACGCTGAGCT GGAAAGGAAA 4200 

AACTCGGTGG TCCTGTACTC CACCCCACCC 4260 

AGAAGCACAT GTGTATTTAT ACCCCCAGGA 4320 

AAGTTTACAC CTTTATCTTT CCATGGGAGC 4380 

CTTTTTTTTT TTGACTAACA AGAATGTAAC 4440 

ACACTACTGC TAAATCCTCA TGTTACTCAG 4500 

CTTCCCTGCT CCAACCCCCG CCACCTCAGG 4560 

CACTGATCAC CCAATGCATC ACGTACCCCA 4620 

GCAACAAGCC CGTTAGCCCC AGGGGATCAC 4660 
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TGGCTGGCCT GAGCAACATC TCGGGAOTCC TCTAGCAGGC CTAAGACATG TGAGGAGGAA 4740 

AAGGAAAAAA AGCAAAAAGC AA66GAGAAA AGAGAAACCG GGAGAAGGCA TGAGAAAOAA 4800 

TTTGAGACGC ACCATGTGGG CAOOGAGGGG GACGGGGCTC AGCAATGCCA TTTCAGTGGC 4660 

TTCCCAOCTC TGACCCTTCT ACATTTGAGG GCCCAGCCAG GAGCAGATGG ACAGCGATGA 4920 

5 GGGGACATTT TCTGGATTCT GGGAGGCAAG AAAAGGACAA ATATCTTTTT TGGAACTAAA 4980 

GCAAATTTTA GACCTTTACC TATGGAAGTG OTTCTAT0TC CATTCTCATT COTGOCATOT 5040 

TTTGATTTGT AGCACTGAGG OTGGCACTCA ACTCTGAGCC CATACTTTTG GCTCCTCTAG 5100 

TAAGATGCAC TGAAAACTTA GCCAGAGTTA GGTTQTCTCC AGGCCATGAT GGCCTTACAC 5160 

TGAAAATGTC ACATTCTATT TTGGGTATTA ATATATAGTC CAOACACTTA ACTCAATTTC 5220 

10 TTGOTATTAT TCTGTTTTGC ACAGTTAOTT GTGAAAGAAA GCTGAOAAGA ATGAAAATGC S280 

AGTCCTGAGG AOAOTTTTCT CCATATCAAA ACGAQGGCTQ ATGGAGGAAA AAGGTCAATA 5340 

AOOTCAAGGG AAGACCCOQT CTCTATACCA ACCAAACCAA TTCACCAACA CAGTTGGOAC 5400 

CCAAAACACA GGAAGTCAOT CACGTTTCCT TTTCATTTAA TGGGGATTCC ACTATCTCAC 5460 

ACTAATCTGA AAGGATGTOO AAGAGCATTA GCTGGCGCAT ATTAAGCACT TTAAGCTCCT 5520 

15 TGAGTAAAAA GGTGQTATGT AATTTATQCA AGGTATTTCT CCAGTTGGGA CTCAGGATAT 5580 

TAGTTAATGA GCCATCACTA GAAQAAAAGC CCATTTTCAA CTGCTTTGAA ACTTOCCTGG 5640 

GGTCTGAGCA TGATGGGAAT AGGGAGACAO GGTAGGAAAG GGCGCCTACT CTTCAGGGTC 5700 

TAAAGATCAA GTGGGCCTTQ GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT 5760 

TAGOGTCTAT GTATTTAGGA TGCOCCTACT CTTCAOGGTC TAAAGATCAA GTGGGCCTTG 5820 

20 GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT TAGOGTCTAT GTATTTAGGA 5880 

TGTCTGCACC TTCTGCAGCC AOTCAGAAGC TGGAGAGGCA ACAGTGGATT GCTGCTTCTT 5940 

GGGGAGAAGA GTATGCTTCC TTTTATCCAT GTAATTTAAC TGTAGAACCT GAGCTCTAAG 6000 

TAACCGAAGA ATGTATGCCT CTGTTCTTAT GTGCCACATC CTTGTTTAAA GGCTCTCTGT 6060 

ATGAAGAGAT GGGACCGTCA TCAGCACATT CCCTAGTGAG CCTACTGGCT CCTGGCAGCG 6120 

25 GCTTTTGTQG AAGACTCACT AGCCAGAAGA GAGGAGTGGG ACAGTCCTCT CCACCAAGAT 6180 

CTAAATCCAA ACAAAAGCAG GCTAGAGCCA GAAGAGAGGA CAAATCTTTG TTGTTCCTCT 6240 

TCTTTACACA TACGCAAACC ACCTGTGACA GCTGGCAATT TTATAAATCA GGTAACTGGA 6300 

AGGAGGTTAA ACTCAGAAAA AAGAAGACCT CAGTCAATTC TCTACTTTTT TTTTTTTTTT 6360 

TCCAAATCAG ATAATAGCCC AGCAAATAGT GATAACAAAT AAAACCTTAG CTGTTCATGT 6420 

30 CTTGATTTCA ATAATTAATT CTTAATCATT AAGAGACCAT AATAAATACT CCTTTTCAAG 6480 

AGAAAAGCAA AACCATTAGA ATTGTTACTC AGCTCCTTCA AACTCAGGTT TGTAGCATAC 6540 

ATGAGTCCAT CCATCAGTCA AAGAATGGTT CCATCTGGAG TCTTAATGTA GAAAGAAAAA 6600 

TGGAGACTTG TAATAATGAG CTAGTTACAA AGTGCTTGTT CATTAAAATA GCACTGAAAA 6660 

TTGAAACATG AATTAACTGA TAATATTCCA ATCATTTGCC ATTTATGACA AAAATGOTTG 6720 

35 GCACTAACAA AGAACGAGCA CTTCCTTTCA GAGTTTCTGA GATAATGTAC GTGGAACAGT 6780 

CTGGQTGGAA TGGGGCTGAA ACCATGTGCA AGTCTGTGTC TTGTCAGTCC AAGAAGTGAC 6840 

ACCGAGATGT TAATTTTAGG GACCCGTGCC TTGTTTCCTA GCCCACAAGA ATGCAAACAT 6900 

CAAACAGATA CTCGCTAGCC TCATTTAAAT TGATTAAAGG AGGAGTOCAT CTTTGGCCGA 6960 

CAGTGGTGTA ACTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TOTGGQTGTG 7020 

40 GGTGTATGTG TGTTTTGTGC ATAACTATTT AAGGAAACTG GAATTTTAAA GTTACTTTTA 70 80 

TACAAACCAA GAATATATGC TACAGATATA AGACAGACAT GGTTTGGTCC TATATTTCTA 7140 

GTCATGATQA ATGTATTTTG TATACCATCT TCATATAATA TACTTAAAAA TATTTCTTAA 7200 

TTGGGATTTG TAATCGTACC AACTTAATTG ATAAACTTGG CAACTGCTTT TATGTTCTGT 7260 

CTCCTTCCAT AAATTTTTCA AAATACTAAT TCAACAAAGA AAAAGCTCTT TTTTTTCCTA 7320 

45 AAATAAACTC AAATTTATCC TTGTTTAGAG CAGAGAAAAA TTAAGAAAAA CTTTGAAATG 7380 

GTCTCAAAAA ATTGCTAAAT ATTTTCAATG GAAAACTAAA TGTTAGTTTA GCTGATTGTA 7440 

TGGGGTTTTC GAACCTTTCA CTTTTTQTTT GTTTTACCTA TTTCACAACT GTGTAAATTG 7500 

CCAATAATTC CTGTCCATGA AAATGCAAAT TATCCAGTGT AGATATATTT GACCATCACC 7560 

CTATGGATAT TGGCTAGTTT TGCCTTTATT AAGCAAATTC ATTTCAGCCT GAATGTCTGC 7620 

50 CTATATATTC TCTGCTCTTT GTATTCTCCT TTGAACCCGT TAAAACATCC TGTGGCACTC 

Seq ID NO; 215 Protein frequence; 
Protein Accession #: NP_002010.1 

55 

21 31 41 51 

III! 

TGSSSGSKLK DPELSLKGTQ HIMQAGQTLH LQCRGBAAHK 60 

CGRNGKQFCfl TLTLNTAQAN HTOFYSCKYL AVPTSKKKBT 120 

EIPEIIHMTE GRBLVIPCRV TSPH1TVTLK KFPU3TLIPD 180 

EIGLLTCBAT VNGHLYKTNY LTHRQTWTII DVQISTPRPV 240 

RVQMTWSYPD EKNKRASVRR R1DQSNSHAN IFYSVLTIDK 300 

SVNT3VHIVD KAPITVKHRK QQVLSTVAGK RSYRLSMKVK 360 

ARYLTRGYSL IIKDVTBEDA ONYTILLSIK QSNVFKNLTA 420 

PALYPLGSRQ ILTCTAYGIP QPTIKWFWHP CNHNHSBARC 460 

RIESITQRMA IIEGKNKKAS TLWADSRIS GIYICIASNK 540 

VHLEKMPTEG EDUOSCTVH KFLYRDVTWI LiIiRTVNNRTM 600 

LTIMNVSLQD SGTYACRARN VYTGBEILQK KEITIRDQEA 660 

DCHANGVPEP QITWKNHHK IQQEPGIILG PGSSTLPIER 720 

KSSAYLTVQG TSDKSNLBLI TLTCTCVAAT LFWLLLTLLI 780 

PDBVPLDEQC ERLPYDASKW EPARERLKLG KBLGRGAFGK 840 

KMLKEGATAS BYKALMTHLK ILTHIGHHLN WNIiLGACTK 900 

LKSKRDLPFL NKDAALHMEP KKEKMBPGLE QGKKPRLDSV 960 

EBEEDSDGPY KEPITMEDLI SYSPQVARGM EFLSSRKCIH 1020 

FGLARDIYKN PDYVRKGDTR LPLKWMAPES IFDKIYSTKS 1080 

PGVQMDEDFC SRLRBGMRHR APBYSTPEIY QIMLDCWHRD 1140 



1 11 

I I 
MVSYWDTGVL LCALI^CLLL 
WSLPEMVSKE SBRLSITKSA 

60 ESAIYTPISD TGRPFVBMYS 
GKRIIWDSRK GPIISNATYK 
KLLRGHTLVL NCTATTPLNT 
KQNKDKGLYT CRVRSGPSPK 
AFPSPEWWL KDGLPATEKfi 

65 TLIVNVKPQI YEKAVSSFPD 
DFCSNNEESF ILDADSNMGN 
VOTVGRNISP YITDVPHGPH 
HYSISKQKMA ITKEHSITLN 
PYLLRNIiSOH TVAISSSTTL 

70 VTEEDEGVYH CKATKQKGSV 
RKMKRSSSBI KTDYLSIIMD 
WQASAFGIK KSPTCRTVAV 
QGGPLMVIVE YCKYGNLSNY 
TSSBSPASSG PQSDK8LSDV 

75 RDLAARNILL SENNWKICD 
DVWSYGVLUf BIFSLGGSPY 
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PKERPRFAEli VEKLGDLLQA NVQQDGKDYI PINAILTGNS GFTYSTPAFS EDPFKE5ISA 1200 

PKFNSGSSDD VRYVNAFKFM SLERIKTFEB LLPNATSMFD DYQGDSSTLL A8PMLKRFTW 1260 

TDSKPKASLK IDLRVTSKSK ESGLSDVSRP SPCHSSCGHV SSGKRRFTYD HAELBRKXAC 1320 
CSPPPDYNSV VLY9TPPI 



Seq ID NO» 216 DNA sequence 

Nucleic Acid Accession #» NMJJ24689 

Coding sequence: 76-624 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CTCTTTOGCC AAOCCCTGCC TCTOTACAOC CTCGAGTGGA CAGCCAGAGG CTOCAGCTGG 60 

AGCCCAGAGC CCAAGATOQA GCCCCAGCTG GGGCCTGAGG CTGCOOCCCT CCOCCCTOOC 120 

TGGCTGGCCC TGCTGCTGTQ GGTCTCAGCC CTGAGCTGTT CTTTCTCCTT GCCAGCTTCT 180 

TCCCTTTCTT CTCTGGTOCC CCAAGTCAGA ACCAGCTACA ATTTTGOAAG GACTTTCCTC 240 

GGTCTTGATA AATGCAATGC CTGCATCGGG ACATCTATTT GCAAGAAGTT CTTTAAAGAA 300 

GAAATAAGAT CTGACAACTG GCTGGCTTCC CACCTTCGAC TGCCTCCCGA TTCCTTGCTT 360 

TCTTATCCTG CAAATTACTC AGATGATTCC AAAATCTGGC GCCCTGTGGA OATCTTTAGA 420 

CTGGTCAGCA AATATCAAAA OGAGATCTCA GACAGGAAAA TCTGTOCCTC TGCATCAGCC 480 

CCAAAGACCT GCAGCATTOA GCGTGTCCTG CGGAAAACAG AOAGGTTCCA GAAATGGCTG S40 

CAGGCCAAGC GCCTCACGCC GGACCTGGTG CAGGACTGTC ACCAGGGCCA GAGAGAACTA 600 

AAGTTCCTGT GTATGCTGAG ATAAC ACCAG TOAAAAAGCC TOGCATGGAG CCCAGCACTG 660 

AGAACTTCCA GAAAGTGTTA GCCTTCTCCC AACTGTGTTA TACCAACCAC ATTTTCAAAT 720 

AGTAATCATT AAAGAGGCTT CTGCATCAAA CCTTCACATG CAGCTCCCAT GCCACCCTCC 780 

AGAATTCACC AACACACAGG CCCACCAGCA ACAGGCTACC TTTGCACAAT ATTCTCTGAT B40 

GACAACTCCA AAGCCCCGGC TCTTTCCACC ACACTGTGGT CCCCTAGATG GGGCTGTTGC 900 

TGAGOCCACC CCAATCCAGA TGTGATCCCC CTGTGATCTA CTTCTGGCAA GATTCTCAGT 960 

CTGGACAGGT CTTCCCTATG AGATAGAACC TGATAAGGAG CTAGGGCAAT TCTGACAACA 1020 

TTACCAAAGG CCCACATAAC TTCTAAATTT TGGTCTGGTC TGAAGGAAAA CCTGTTCTCG 1080 

CCCTAGTGAT GGATGAACTC TCTTATCTCT GGCTTCTAGA GGGAAAAAAA AAGCATACCT 1140 

CTTTTACTTT TTAAGTACCT CCATCAGAGT CATQAAATCA CCTGTCAAGA CTATCTATCT 1200 

TTTATGTTTC CATTCTGGTA AGAACTCTTT AAATGAGGAG ACTGCTGATT GCTGGTGATG 1260 

TTTTTTGAGC AAACACTCGG GGGTATGGAT GAAAGCCAAT CGCAGGTCAA ATGACTCCTT 1320 

GGGGAAGCTA CTTCTCCTCT ATTCAGATTT CACTAAAATC TTCCAAGATO AAAGCAAATC 1380 

TAGATTTCGG TCTTCATTGC TOTCCATTPT TGTAATGAAC GAOTGTTTTT CCTTTAGCTA 1440 

GTGTATCAGG CAGGGTTCTA CCAGAGAAAC AGAACCAGTA GGAGATACAT ATACATGTCC 1500 

AGATTTATTT CAAAGAATTG ATTTACATGA TTGTGGGGAT TGGCAAGTCC AAAATCCATA 1560 

TGGTAGGCCT GCAATCTGTA AACCTTTGGG CAGGAGCTGA TGCTGTAGTT TGCAGATAGA 1620 

ATTCCTTGTT CCTTAAAAAA ATCTGTTTTT GTTCTTAAOG GCTTTGAATG ATTGGATCAG 1680 

GCCCACCCAG ATTACCTAGA TAATCTCTTT TACTTAAAGT AAACTGATTG TAGGTGCTAA 1740 

TCACATCTAT GAAATGCCTT CACAGCAACA CCTAGATTAG CATTCAATTG AATAACTGGG 1800 

GAATACAGCC TAGCCAAGTT GACACATAAA ATTAACCATC ACAGCAACAT GCCTGCTAAA 1860 

TTTTATCGAC CGTCTTCAGA CTGTTAAGGA TTGTGGTAGA GAACTGTGAC AGCCACTCTC 1920 

AGCATCACCC TGAACCAAAG GCCCCTATCA AGTAACAATA TAGCCAAGCA AAATTCCAGT 1980 

CAATAGAGAC ATTGACTGGT TGGCTGGCTT CCCAAGGGAT AGCACCAGAC AAQAAATGCA 2040 

AGGATGAGGA AACCAGGCAC GGGAGAGGGA GGGGCAACAG AGGTCCAGGG TTTGGTTATC 21O0 

TTTTTATTTT TCACTGGGAG GTGGTAAGTT AGCCCTGTTG CCCATGTATG CAGATGGGAG 2160 

AAGTGATTTA GAAACTCCAA AGCAATTGGT AATCCCCAAA ATGGGTGTAT CTGGTTTGAA 2220 

ATGAAACCTT ATTTTATTGO AAATGGTTGG TTTCCCAATT CTGTTTGCCA TTGGCCAATA 2280 

TAATTGTGGG TTTGCACATG GCCAGCACAT GCCAAACAGA AGTAGACAAA GGTCTCACTC 2340 

TGTAAGTGGG ACCTTGGGGA GGAGCTGCCT CCATCATAAA GGGAGGGGTT AGTAAAAATG 2400 

GTCTCTTAAG CCTGTTCCTG CTACAGTTAT AGAGGTTGCT CAGAACCTTC TCAGCAAATA 2460 
TAGCAGTTAT CTATTGTTGT GTATTAAACC ATTTCAACAC AT 



Seq ID NO: 217 Protein se quence: 
Protein Accession #: NP_078965.1 

1 11 21 31 41 51 

| || 111 

MEPQDGPRAA AIiRPGWLALL LWVSALSCSF SLPASSLSSL VPQVRTSYNF GRTPIGLDKC 60 
NACIGTSICK KFFKBBIRSD NWIASHLGLP PDSLLSYPAN YSDDSKIMRP VEIFRLVSKY 120 
QNEISDRKIC ASASAPKTCS IBRVLRKTBR FQKWLQAKRL TPDLVQDCHQ GQRELKFLCM 180 
LR 



Seq ID NO: 218 pflft sequence, 

Nucleic Acid Accession Jh AF075 027.1 

Coding sequence: 3-269 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

III III 

QATTAATTAA GTGCTTTAAA CGGTCTTGGT AAATATTCCG CGGGAGCTGG GGAGGACCGT 60 

TGGGATGGCT GTAGCTTGAG TTGAATTTTA ACTGTCCTCA TTCTGGGTTT TGTCGCTCTG 120 

CTTTCTGTGC CAAGGTGCTG TOTTACGGGA GAGAGTGACT GGAAAGTAAC AAAGCTGAAT 180 

CTTTCTCCCT GGAGTAAGGC CGAAGACTGG ATTACTACAC GCCTAGACGT GACACTACAC 240 

CCATAGATCT CATGCATCAT TAATGCCA TA TGACATTGCC ATTTTCTTTC TCAGTTCACG 300 

QACAAAAGTG GTGGGTTTTC ATTGTCTTCA CTGATTGTCA ATGCATTAAT AAAGAAGATG 360 
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TQTGGT 

Seq 10 NO: 219 Protein sequence! 
Protein Accession 8: AF075027 



1 11 21 31 41 51 

I i I I I 1 

ERKWQCHMAL MMHEIYGCSV TSRRWIQSS ALLQGSRP8P VTFQSLSPVT QHLGTESRAT 
KPRMRTVKIQ LKLQPSQRSS PAPABYLPRP FKALN 

Seq ID HO: 220 DNA geouepee 
Kucleic Acid Accession «: AL133411.8 
Coding sequence: 1-1395 (underlined sequences 



correspond to start and atop codons) 



1 
I 

AJJGGGCAAGG 
AAATQGGATC 
AACAOOCAAC 
ACATCCAGAA" 
AAAAAGGACA 
AACCAAACAC 
CAGGGAGGGG 
CTTGAACATG 
TGGGCCCAGG 
ATCTGGQTGG 
CTACCTTTAA 
TATGCTCCTG 
ACAACACAAA 
AATTTTGCTC 



GTGCCTGCAT 
QATAAAGATC 
AATCAGCCAG 
CTCCTCCTCr 
CTGAGTTATA 
TACTTTCATC 
ACATTTTTGG 
AAGATAATGA 
TCGGTTACCC 



11 
I 

ACTTCATGAC 
TAATCAAATT 
CTACAGAATO 
TCTATAAGGA 
TGOATGAAGC 
CACATGTTCT 
AACATCACAC 
CTGGGAGGAA 
AACAATCCAT 
CACAAACTCC 
ATOAAOATOT 
CTAATGAGAA 
ATCCAAATGG 
TAAAAAACGA 
CAACTACTAO 
TTTGGACAAT 
AATTATTTCA 
ATCTAGAGGA 
TTGTGGTCCT 
AAAGTTQTGA 
CATCAGAAGG 
GTACCACTTC 
OGGATATCAT 
G GTGA 



21 
I 

TAAAACACTA 
AAAGAGCTTC 



ACTTAAACAA 
TGGAAACCGT 
CACTCATAAG 
ACTGGGGCCT 
ATTACAATTC. 
AATACAAAAG 
TGGTGAATCT 
TAATAAGCAG 
AAATGGCAAT 
CACTGAGTCT 
TAAAACTGTC 
CGAACCCTCT 
GTTAGCTAAA 
CCCAATTCCA 
TCTGAAGATC 
CTTGGCATTC 
GAGTCAGTAC 
TOTTTCAGAT 
TTCAGATATG 
TTCCATAGGC 



31 
I 

AAAGCAATGG 
CGCACAGCAA 
TTTGCAATGT 
TTTTACAAGA 
CATTCTCAGA 
TGGGAGTTGA 
GTCAGAAGCC 
ATCCATGGGC 
AAATATGCAT 
ATCTCCAGTT 
GAAGAAAAGA 
TATTATAAAG 
GAAATATCTG 
AATGCAACTA 
CATAAAAATA 
GCTATAAATG 
GAGTCTGATG 
AAAATAATGC 
TGTAGTGCTA 
TCTGTCAACC 
ACATCCTTTT 
AGAAGATCAG 
TCAGATAATG 



41 

I 

CAACAAAAGC 
AAGAAACTAT 
ATCCATCTGA 
AAAAACCAAA 
AAACTAACAC 
ACAATGAGAA 
CCTCTGGCCT 
TGTTTACCCT 
TATGGATTGG 
CACCAOCATT 
ATGAAGATCA 
ATATAAAACA 
TGAGAGCCAC 
CATATGAAAA 
TTCAAAGATC 
GAACAGCAGT 
TGAATGCTAC 
TGGGAATCTC 
CACTGTACAA 
CAGAGCTGGC 
CCAAGAGTGC 
GCACAAGAAC 
AGATGCATGA 



CAAAATTGAC 
TATCAGAGTG 



CAACGCCATC 
AGGAACAGAA 
CACATGGACA 



TGAAAATGAA 
AACCAAGCAG 
GCCTAATGTG 
TACTCCCAAT 
ATATGTGTTC 
AACTGACCTG 
ATCCACCATT 
AACCCCAAAC 
GGTCATGGAT 
ACAGGGAGAA 
GTTGATGACC 
ACTGAGGCAT 
CACGATGTCT 
AGAGAGCAGC 
ATCAGAATCT 
AAACGATGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Seq ID NOj 221 . 
Protein Accession fh AL133411.8 



MGKDFMTKTXi 
TSRIYKELKQ 
QGGBHHTLGP 
IWVAQTPGES 
TTQNPNGTES 
VPAFWTMLAK 
LLLFWLLAF 
TFLGTT83DM 



11 
I 

KAMATKAKID 
PYKKKPNNAI 
VRSPSGLLAG 
ISSSPALPNV 
8ISVRATTDL 
AINGTAWMD 
CSATLYKLRH 
RRSGTRTSBS 



21 
I 

KWDLIKLKSP 
KKDMDEAGNR 
LEHAGRKLQF 
LPLNBDVNKQ 
HPALKNDKTV 
DKDQLFHPIP 
LSYKSCESQY 
KUfTOIISIG 



31 


41 


SI 




1 

RTAKBTIIRV 


1 

NRQPTEWEKN 


1 

FAHYPSDKGIj 


60 


HSQKTNTGTE 


NQTPHVLTHK 


WBLNNENTWT 


120 


IHGLFTLENE 


WAQEQSIIQK 


kyalwigtkq 


ieo 


EEKNEDHTPN 


YAPANKKNGN 


YYKDIKQYVF 


240 


NATTYEKSTI 


BEETTTSEPS 


HKNIQRSTPN 


300 


BSDVNATQGE 


NQPDIiEDliKI 


KIMLGISLMT 


360 


SVNPELATMS 


YFHPSEGVSD 


TSFSKSAESS 


420 


8DNKMHENDE 


SVTR 







Seq ID NO: 222 
Nucleic Acid Accession 0: AL050295.1 

Coding sequence: 237-2073 (underlined sequences correspond to start and stop codons) 



GAAGGGGACA 
AGCATTTGAA 
TGCCCTCGAG 
TTTTAGAAGC 
AATCCCCAAG 
CACTGAACTG 
CTGGTGAAGA 
AATAGACTGT 
CCTACTTGAA 
TTTTGAGCAT 
GCGAGACAGG 
GTGAGGTCTT 



11 
I 

GAAGGCAGTT 
GTCTGGTCTT 
GTACACCTCA 
CTGAAAACTC 
GAGAACCACT 
GAATTACGAG 
GGCACTGAGG 
TAATATTGAG 
CAGCCTCAGT 
AAATGTGACA 
TTATGGGTGG 
CCTCCCAGGG 



21 
I 

CACCTCTGCT 
GTGAAACCCC 
CCTGAGAGGG 
CAGAAGAGAA 
TTGTGCCTCA 
TCTACTATTC 
CAAAAACGAG 
ATCAGTTTTG 

tttccaattc 
acagtctgca 

CCTCGGGAAA 
CACCATTGCA 



31 
I 

CCCGACAGCC 
ACCCTCCTCT 
TTTTOGGCAG 
AGGCCAACCA 
TGTTTATTGT 
ATCCTTTGAG 
CCGTTGCCAC 
AAAATOCATC 
ATGGGAATAA 
GACCTGCTGG 
GGTGTCTTCA 
GTTGCCTTAA 



41 

1 

TGGGAACCCG 
GGCTGTGTGA 
ATCAGCAGTA 
ACTCAAACTT 
GATTTATTCT 
TCTTCATGAA 
AAAAAOTCCT 
CTTCCTGGAT 
CACTGACCAA 
AAATGAAATC 
CAATCTCATT 
AGAACTGCCT 



51 
I 

CAAGAGCCCC 
TTGAATGGGA 
AGGTGTTAAA 
GAAGACA3SA 
TCCAAAGCTG 
CATGAACCAG 
ACGGCTGAAG 
CCTATCAAAG 
ATTACTGACA 
TGGTGCTCCT 
TGTCAAGAGC 
CCCAATGOAC 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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10 



15 



20 



25 



CTTTTTGCCT 
TTCAAGAAGA 
AAACAGCGTT 
GGTTCAAGTC 
AGTTAATACA 
ACTACAACTC 
TCATCTTTGA 
TGTCTTQOCO 
ACACCGCACT 
CAGOTGATGC 
AGAAGAAAAT 
GCGACAACAA 
TAGAATGGAA 
CTAGCTGCAQ 
GAACAACAGT 
ACATAAAAGT 
TTTCTQAOGQ 
TTTATTGGAA 
ATCTTGATGG 
CCTATCACTQ 
TTCACCCGCT 
GCAGTGGTTC 
TCCATATGGQ 



GCTTCAGGAA 
CCTCATGAAC 
CCGGAAGGGT 
TGOAAGTOTQ 
TAAAGCCAAT 
CTTTCAAGCA 
AGGGGACACA 
CTATGAAGAA 
TTTCAACAAC 
AGGTGAATAT 
AOATOTTATQ 
TCCTGTATCT 
GCAGGAAGGA 
CAGATACACC 
CATCTACACT 
GACATTCATC 
ACAAAACTTT 
CACTTCTGCT 
AGCAGAATCA 
CATATTTAGA 
GCCTCTAAAO 
CCATCACATC 
TTCCTCATCC 



GATGTTACCC 
ACTTCCTCOG 
TAOGGAATTT 
GTTGTGACAT 
GAACAAGTTQ 
GTTACTATCA 
GTCAGTCTGG 
CAGCMZTTGG 
ATGACTTOGG 
GTTTGCAAAC 
CCCATCCAAA 
TTGAACTGCT 
AAAATAAATA 
CTCAAGGCTG 
TGTGAGTTCA 
TCTGTGGCCA 
TCTATAAAAT 
GGAATTAAAA 
GTACTGACAG 
TATAAGAATT 
CTGAACATCA 
AAGTGCTGCA 
CTTCCTGCTG 



TGAACATGAQ 
CCCTCTATAG 
TACCAGGCTT 
ATGAAGTCAA 
TACAGAGCCT 
ATGAAAGCAA 
TGTGTGAAAA 
AAATCCAGAA 
TGTCCAAGCT 
TGATATTAGA 
TTTTGGCAAA 
GCAGTCAGGG 
TTCCAGGAAC 
ATGGAACCCA 
TCAGTGCCTA 
ATCTAACAAT 
GCATCAGTGA 
TATACCAAAG 
TCAAGACCTC 
CATACAGTAT 
TGATTGATCC 
TAGAGGAGGA 
JAAAAAAAAA 



AGTCAGACTA 
GTOCTACAAG 
CAAGGGOGTG 
GACTACACCA 
CAATCAGACC 
TTTCTTTOTC 
GGAAGTTTTQ 
CAOCAGCAGA 
CACCATCCAC 
CATTTTTQAA 
TGAAGAAATG 
TAATGTTAAT 
CCCTGAGACA 
GTGCCCAAGC 
TGGAGCCAGA 
AACCCOGGAC 
TGTQAGTAAC 
ATTTTATACC 
GACCAGGGAQ 
TGCAACCAAA 
TTTQGAAOCT 
TGGAGACTAC 



AATGTAGGCT 
ACCGACTTGG 
ACTGTGACAG 
CCATCACTTG 
TACAAAATGG 
ACACCAGAAA 
TCCTCCAATG 
TTCTCGATTT 
AACATCACTC 
TATGAGTGCA 
AAGGTGATGT 
TGGAGCAAAG 
GACATAGATT 
GOGTCGTCTG 
GQCAGTGCAA 
CCAATTTCTG 
TATQATOAGO 



TOGAATGGAA 
GACGTCATTG 
ACTGTTTCAT 
AAAGTTACTT 
A 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1680 
1740 
1600 
1860 
1920 
I960 
2040 



Seq ID NO: 223 Protein sequence; 
Protein Accession ft; CAB43394.1 



30 



35 



40 



i 
I 

MKSPRRTTLC 
BBYTVNIBIS 
SCETGYQWPR 
GFQSDLMNTS 
LKLIHKANEQ 
HVSWRYBEQQ 
CKKKIDVHPI 
DSSCSRYTLK 
SVSEGQNFSI 
GTYHCIPRYK 
TFHMGSS8LP 



11 
I 

L*IFIVIYSSK 
FKNASFLDPI 
ERCLHNLICQ 
SALYRSYKTD 
WQSLNQTYK 
LEIQUSSRP8 
QILANBEMKV 



21 

I 

AALNWNYEST 
KAYLNSLSFP 
BRDVFLPGHH 
XiBTAFRKGYG 
MDYNSFQAVT 
IYTALFNNMT 



KCISDVSNYD 
NSYSIATKDV 
AVKKKKKK 



SGTTVIYTCE 
EVYWNTSAGI 
IVKPLPLKLN 



31 
I 

IHPLSLHEHE 
IHGNNTDQIT 
CSCLKELPPN 
ILPQFKOVTV 
INESNFFVTP 
SVSKLTIHNI 
CCSQGNVNWS 
PISAYGARGS 
KIYQRFYTTR 
IMIDPLEATV 



41 

I 

PAGEEALRQK 
DILSINVTTV 
GPFCLLQEDV 
TGFKSGSVW 
BIIFBGDTVS 
TPGDAGBYVC 
KVEHKQBGKI 
ANIKVTFISV 
RYLDGAESVL 
SCSGSHHIKC 



51 
I 

RAVATKSPTA 
CRPAGNBIWC 
TLNMRVRLNV 
TYBVKTTPPS 
LVCEKBVLSS 
KtilLDIPEYB 
HIPGTPBTDI 
ANLTITPOPI 
TVKTSTREWN 
CIEEDGDYKV 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



45 



50 



55 



60 



65 



70 



75 



Seq ID HO: 224 pNA sequence , 

Nucleic Acid Accession #i NM_007268 

Coding sequence: 46-1245 (underlined 



sequences correspond to start and stop codons) 



GGTAGCAGGA 
GGCCTGCTAC 
CCAGAGAGTG 
CTGCAAGGCT 
ACCATCTTTC 
CTGCATGTGA 
GATGACCGGA 
GTGAGAGATA 
ACAACTGOCA 
CAGGCTGSGG 
GAACCCATCA 
TCAGGCTCCT 
GTGAAGTTTG 
ACCATGACAT 
GACATGGATG 
GCCATCATCC 
CTCTGTCGGA 
GAGGCCAACG 
GATGAGCCAA 
OAGTACCAGA 
CTGGATTATG 
CCAGGATCTG 
GCTACCTCTC 
TCACTGGCTT 
GCTCTGGGCC 
GGGAAGATGC 
GCATCTTGCC 
TATCCAGGAT 
ACAGGCCAGG 



11 

I 

GGCTGGAAGA 
TCCTGGGGCA 
TAACAGGACC 
ACACCCAAGT 
TAOGTGACTC 
GCCACAAGGT 
GCCACTACAC 
AGATTACTGA 
GCGGTTATGG 
GTTCTCCTCC 
AAGTAGCAAC 
ATTTCTGCAC 
TGGTCAAAGA 
ACCCCTTGAA 
GCTACCTTGG 
TCATCATCTC 
AOACATCCCA 
ACTCTGGAGA 
CTTCCCAGAA 
TCATCGCCCA 
AGTTTCTCGC 
CTGACATAAT 
TTCCTGGATA 
TGCCCTGGAA 
CTTCTAGTAT 
CGATAGCACT 
ACCAGAAGAC 
CATTTCTCTT 
GTTCAGTTCT 



21 

I 

AAGGACAGAA 
CCTAACAGTG 
TTGGAAAGGG 
CTTGGTGAAG 
TTCTGGAGAC 



31 



GTGTGAAGTC 
GCTCCGTGTC 
CTTCACQGTO 
CATCAGTTAT 
CCTAAGTACC 
TGCCAAGGGC 
CTCCTCAAAG 
AGCAACATCT 
AGAGACCAGT 
CTTGTGCTGT 
ACAAGAGCAT 
AACCATGAGG 
TCTOGGCAAC 
GATCAATGGC 
CACTGAGGGC 
TGCCTAGTCA 
GCCCAAAGTG 
TTTGCCAGAT 
CTCTGCCGGG 
AGGACTTGGT 
CCGAGGGAGG 
TCTTCAGGGC 
GCTCCTCCAC 



GTAGCTCTGG 
GACACTTATG 
GATGTGAATC 
TGGCTGGTAC 
CATATCCAGC 
GTATCCCTCC 
ACCTGGCAGA 
CAGAAACTCT 
CCCCAGGGAA 
ATTTGGTATA 
TTACTCTTCA 
CAGGTTGGCT 
CTACTCAAGA 
ACAGTGAAGC 
GCTGGGCCAG 
ATGGTGGTTT 
GTCTACGAAO 
GTGGCCATCT 
AACTACTCTG 
AACTAOGCCC 
AAAAGTGTCT 
GTCCTTGCCT 
TCCGCCTACC 
GCATCTCAAG 
GGCTTCTGGT 
CATCATGCCT 
CTCAGCTCTG 
CAGACAGCTT 
TATAAGTCTA 



41 

I 

CTGTGATGGQ 
GCCGTCCCAT 
TTCCCTGCAC 
AACGTGQCTC 
AGGCAAAGTA 
AATTGAGCAC 
CTCCTGATGG 
CTGTCTCCAA 
TGAGGATTAO 
AGCAACAGAC 
AOCCTGCGGT 
CTGAGCAGCA 
CCAAGACTGA 
AGTCCTGGGA 
GAAAGAGCCT 
TTACCATGGC 
CAGCCAGGGC 
TCGCAAGTGG 
ATGAGCCCTO 
GCCTGCTGGA 
QTTAAAAATO 
TCTGCATGGC 
AACACTGGAG 
TAAGCCAGCT 
ACTCCTCTCT 
ACAGACACTA 
CCAGCTCAGA 
TTAATTGAAA 
ATOTTCTGAC 



51 
I 

GATCTTACTG 
CCTQGAAGTG 
CTATGACCCC 
AGACCCTGTC 
CCAGGGCCGC 
CCTGGAGATG 
CAACCAAGTC 
GCCCACAGTG 
CCTTCAATGC 
TAATAACCAG 
GATAGCCGAC 
CAGCGACATT 
GGCACCTACA 
CTGGACCACT 
GCCTGTCTTT 
CTATATCATG 
ACATGCCAGA 
CTQCTCCAGT 
CATAGGACAG 
CACAGTTCCT 
CCCCATTAGG 
CTTCTTCCCT 
COGCTGGGAG 
GCTGGATTTG 
AAATACCAGA 
TTCAACTTTG 
GGACCAGCTA 
TTGTTATTTC 
TCTCTCCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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TGCTCAATAA ATATCTAATC ATAACAGCAA AAAAAAAAAA AAAAAAA 

fieq ID MO: 22S prp£qjn, sequence: 
Protein Accession #: NP_009199.1 

1 11 21 31 41 51 

I I I I I I 

MGIIiLGLLLL GHLTVDTYGR PILEVPESVT GPWKGDVNLP CTYDPLQGYT QVLVKWLVQR €0 
GSDPVTIFLR OSSGDHZQQA KYQGRLHVSH KVPGDVSLQL STLBMDDRSH YTCBVTWQTP 120 
DGNQWROKI TBLRVQKLSV SKPTVTTGSG YGFTVPQGMR ISLQCQARGS PPISYIWYKQ 160 
QTNNQEPIKV ATLSTLLFKP AVIADSGSYF CTAKOQVGSE QHSDIVKYW KDSSKLLKTK 240 
TEAPTTMTYP LKATSTVKQ8 WDWTTDMDGY LGETSAGPGK SLPVFAIILI I8LCCMWPT 300 
MAYIMLCRKT SQQBHVYEAA RAHAREANDS GBTNRVAIFA SGCSSDBPTS QHLGNNYSDE 360 
PCIGQEYQII AQINGNYARL LDTVPLDYEF LATEGKSVC 



Seq ID NO: 226 pNA, sequence 

Nucleic Acid Accession ft> XM_6432l 

Coding sequence: 1-2079 (underlined sequences correspond to start and stop codons) 



ATQG TCGCCA 
CCAGGACCAC 
GAGCCTGGTA 
GGGGTTGGGA 
ATCOGTCAGG 
CCATACGAAG 
CTCACAAATG 
CATAATCAAT 
TTCCCCTCCA 
CCTGAAGTTA 
TCCCTTTGTG 
GAAAGCAGAA 
CGCAAGTCAC 
GTTCACACCA 
CAGCCAATAC 
GGTGATCTTG 
CGGAGGAGCC 
CTCAAGAGGT 
CCCACATTCA 
TCCCTGCTCC 
GGGGGG05CC 
AGGCCAGGGA 
TCCTCTTTAG 
TGCTTAGAAG 
GGTCCAGACA 
CAGCACCAGA 
AATGAATATG 
TATTCCCCAA 
GTCTCCCTTT 
GTCGTAGCAG 
CAGAAGCAGC 
CGCAAGTTCC 
CCCCCGGCCG 
TGCTCCTGCT 
TTCATTCCAC 



11 
I 

QTTCCGATCA 
GCCTCOGCTC 
CTTACAGGGA 
TCACGGGAAA 
AGGATGCCTT 
CTACCTTGCA 
GCTACCTGCC 
ATCCTAATGG 
CAGCCACCAC 
AACTAAAAAT 
GAGACCTTTT 
AAGAAAAGAG 
ACAAAATCCC 
TATCAGAAAA 
TATCTTCTGT 
TGTGGTCCAA 
ATCACTGTTC 
CTTTCATGGT 
AGGGAACTGC 
TTGGTCATTG 
GCTGGGTGTG 
AGGAACCTCT 
CCACCTCTCA 
ACTATGCAGG 
GCAGGACTGG 
GATATCTCCT 
ATTGCAAGCT 
CCCACATCCT 
CCAAGTTCCT 
TACGTAGGAG 
CCTGTCCTGC 
ACGTGATGGC 
CGCCCCGCAA 
CCCAGGATOT 
ATCCCTGGGT 



21 
I 

AGACAGAGCC 
TGCCCAGAGG 
GGGTGGTGGA 
CACAGTTCAA 
TGATAACAAA 
GCAAAGCTTT 
ATCAATCAGC 
AAATTCTAAA 
TTCGGTACCA 
AACCAAAACT 
AAATGAAGTA 



CAAATTAGAA 
ACCAAGGGAA 
TCCAACAACA 
GGTGACGGTC 
CAGCTGCCTG 
TTCTTCCTTG 
CCAGATGGGC 
GGAAGGAACA 
GCAGCATCAG 
GAGACTCAGT 
GGGCTGCTGG 
GGGCCGCCAT 
AAGCCTTAGA 
GCAAGGCCTC 
AGAGACGAGA 
TCAGTCTGAA 
CAAACGCAAA 
ATCCAATATG 
CAAGTACACG 
TCAGAAGAGG 
GGTGGCTGAC 
GTATCTGACT 
GCCCTTCGGC 



31 
I 

CCGTATCTTC 
CCAAAAGCAG 
GCCATCGTCC 
CAACCACCTC 
ATTGACATTG 
CAATACTCAC 
ATGTATGAAA 
CAGAAGACCA 
CAAACTGTGA 
ATCCAGAATG 
CAGGCAAGTG 
AAAAAGCATG 
CCAGAGGAAC 
GATCCAGTAC 
GAAGTGTCCA 
ACACCCTGTT 
GAGATCTTGG 
AAGTTCCTCA 
TGGTCACCTA 
GACCAGATGT 
AAGCCTCAGA 
TTCCTACGAT 
TGTTCGCCCC 
TTGACACTCA 
GCTGTCGGCA 
CTAGGTGGGT 
GAAGCGGCGT 
AGTGCCCCTA 
GCAAACAGCC 
CCTGGCACAA 
CCTGCCTGCC 
GGCCTGTCAG 
AGACGCCAGC 
GGAGTTTCTG 
TCCTCCTAG 



41 
I 

CAGGGACACT 
CCCAACAAGA 
TCACGTATGC 
AACTCACTGA 
CTGAAGATGG 
CTACAACAGA 
TTCAAACCAA 
CATTAAATTC 
TTCCAAAGAA 
GCAGGGAATT 
AGCACACGAA 
ACTCATCAAG 
AAAATAGACC 
TAAAAGAGGA 
CTGGTGTTAA 
GGGTGCCCCG 
TGCTGGTGCC 
CCTCCACGGG 
TGGCCTCCAC 
CATCCAGGGG 
TCCGCATCTC 
GTGAAGTGGA 
CAGACCACGT 
GAGCCCAGGA 
AGAGATACTG 
TCTTGGAAGA 
CCTCAACTCC 
ACCACTACTT 
ATTTCCTGCA 
GGGGGTGGGG 
ACGCACAATG 
GAAGATGTAG 
AGCTGCCGGG 
GATTAAAGGC 



51 
I 

AGACAAGATG 
GCCCGGCATT 
GCTGGGGATC 
CTCCGCCAGC 
TGGCCAGACA 
TCTTCCTCCA 
ATACCAGTCG 
TAGAAAACCC 
GAQTGGCTCA 
GTTCAAGTCT 
GTCAAAGCAT 
ATCTGAAGAG 
AAATGAGAGG 
AGCCCCAGTT 
GTTTCAAGTT 
CCTGCGAGGA 
AGCCCTCAGC 
CAAACAGAAG 
GACCAATGTC 
OCCGGAATTT 
CATCTGCCAC 
GAGAAGAATC 
CTGTGAGAAA 
AGCCTTTCTT 
CAGGAACAGC 
AAGGAATGCC 
AAGAATCCCG 
T CCCTACC AC 
CCTGTGTGCA 
TGGCCACAAA 
GGAGACATTC 
GGGGCAGGAG 
GGCTCCGGGC 
CAGTCGTGGC 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



Seq ID NO: 227 frot;ej.n. sequence: 
Protein Accession XP_064321.1 

1 11 21 31 41 SI 

I I till 

MVA3SDQDRA PYLPGTLDKM PGPRLRSAQR PKAAQQBPGI BPGTYREGGG AXVLTYALGI 60 

GVGITGNTVQ QPPQLTDSAS IRQBDAFDNK IDIABDGGQT PYBATLQQSF QYSFTTDLPP 120 

LTNGYLP9I8 MYEIQTKYQS HNQYPNGNSK QKTTLNSRKP PPSTATTSVP QTVIPKKSGS 180 

PEVKLKITKT IQNGRELFKS SLCGDLLNEV QASEHTKSKH BSRKEKRKRP KKHDSSRSBE 240 

RKSHKIPKLE PEEQNRPNBR VHTISEKPRE DPVLKEEAPV QPILSSVPTT EVSTGVKFQV 300 

GDLVWSKVTV TPCWVPRLRG RRSHHCSSCL EILVLVPALS LKRSFMVSSL KELTS TGKQK 360 

PTFKGTAQHG WSPMA9TTNV SLLLGHWEGT DQHSSRGPEP GGRRWVWQKQ KPQIRISICH 420 

RPGKBPLRLS FLRCEVERRI SSLATSQGCW CSPPDHVCEK CLBDYAGRRH LTLRAQBAFL 480 

GPDSRTGSLR AVGKRYCRNS QHQRYLLQGL LGGFLBERNA NEYDCKLETR BAASSTPRIP 540 

YSPTHILQSE 8APNHYFPYH VSLSKFLKRK ANSHFLHLCA WAVRRRSNM PGTRGHGGHK 600 

QKQPCPAKYT PACHAQWETF RKFHVMAQKR GLSGRCRGQQ PPAAPRKVAD RRQQLPGAPG 660 
CSCSQDVYLT GVSGLKASRG PIPHPWVPFG SS 



285 
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Seq ID HO: 226 j 
Nucleic Acid Accession #: NMJ>06033 

Coding sequence i 253-1752 (underlined sequences correspond to start and stop codons) 



1 
I 

AGCAGCGAGT 
CCGTTGACAC 
CCTTCTCTOC 
GCTGGAAACA 
TGGCGGGGCA 
TTTGCTOCGO 
AAACCCAAAG 
AAGGACCCAO 
TGCAGTTTCA 
ATCTTTOAAA 
AATGTAGTTG 
AATACCAGGG 
GATTTTTCTC 
TATGCAGGCA 
CCCATGTTTG 
GATQTCCTCC 
GGCCACATTG 
GTCTTGGGAT 
GCCGTCCACC 
TGCACTGACT 
AATAGCATTG 
AAAACCCGGG 
AGTTACAAGA 
GCAGATTCCC 
ACCTTCCTGG 
GAGGGGGCCT 
CCCCGCAACC 
CAGCGGAAAC 
GAGCTCTGGT 
GTGGAGCTTC 
TATCCAAGCC 
AGCCTTGACC 
GGAGGGGACT 
TCTGTCCACA 
GCCTCTCGTG 
GAGGAACGCT 
GAGCCTCAGT 

TCTTAGCCAT 
TATTTTTCTG 
ATGTGGTAAT 
TAGACCCTCT 
CTGGGGGAGG 
CTTCTTAGTT 
GTGTTTGATC 
GCCGGTTGCC 
ATGACATATG 
TCTAAGATTT 
CTTTTCTTGT 
ACACAATATC 
GTAGGGGGAG 
TCAGCCACAT 
TTCCTGGAGG 
AATATGCCCC 
ATCCTAAGCC 
TTATGGCTGA 
TGTACTTAGG 
TTAAACTTCT 
GTGTGCTTCT 
TATCCCTTAT 
TTTGGAATGT 
AAAATTCATT 
TATTTTAAAT 
CATTTTTTGT 
TTTTTGGATT 
TCTGTTTTTA 



11 
I 

CCTTGCCTCC 

TCGCTCccra 

CTCCAGTCCC 
CCAAGAGGTG 
OGATQAGCAA 
GGAGCCCCGT 
CTACACAGAC 
AGCATGAAGG 
ACATGACAGC 
ACTGGCTGCA 
TGGTTGACTQ 
TGGTGGGACA 
TCGGGAATGT 
ACTTCGTGAA 
AAGGGGCCGA 
ACACCTACAC 
ACATCTACCC 
CAATTGCATA 
TCTTTGTTGA 
CCAATCGCTT 
GCTACAATGC 
CAGGCATGCC 
ACATGGQAQA 
AGACTCTGCC 
TCTACACCGA 
CTCAGTCTTG 
CCGGACGGGA 
TGACATTTTG 
TTCGCAAGTG 
C CTGAG GGTQ 
CATGGAGGAA 
CTGGAGCACT 
GCGCTGCTAT 
CCTCCAQAGC 
CACACTGGAT 
GGCTCCGAAG 
GAGAAGTCCT 
GGCCCTAGCT 
TCCGTCCTGC 
TTCATTTTTT 
GGACATATTA 
TTCTGTTTGG 
CTATAGGATA 
ATTATGTGCC 
ACTAGCAGAA 
AGATATAACT 
TAATACACAT 
AATACAGTGC 
TGAACTTCTT 
TGAGACACTT 
AAGGGCAACT 
GCCTAGACTT 
CAAAGTCTAT 
AGACGTGAGA 
CTTTTATTAA 
GATTCGGGAG 
TTTTCTAAGG 
GAAGACATAA 
CTTGTTTCTG 
CCAAAATGCT 
TTGCATATAC 
GATGTGTCAG 
AQTTGTGTAC 
CTTGTTGGTG 
AGGGTTGCTC 
CTAATGGAAG 



21 
I 

CGGCGGCTCA 
CCACCGCCCG 
CCAGCCCCTG 
GTTTTTGTTT 
CTCCGTTCCT 
ACCTTTTGGT 
TGAGGTCAAA 
ATGCTACCTC 
TAAAACCTTT 
CAAACTCQTG 
GCTCCCCCTG 
CAGCATTGCC 
CCACTTGATC 
AGGAACGGTG 
CATCCACAAG 
GCGTTCCTTC 
CAATGGGOGT 
TGGAACAATC 
CTCTCTGGTG 
CAAAAAGGGG 
CAAGAAAATG 
TTTCAGAGTT 
AATTGAGCCC 
ACTGGAAATA 
GGAGGACTTG 
GTACAACCTG 
GCTGAATATC 
TACAGAAGAC 
TCGGGATGGC 
CCCGGGCAAG 
AGTTACTGCT 
GGGAACAACT 
AGCTCTTGCT 
ACCAAGTCCA 
TGGTTTCTCA 
AGGCCCTGTG 
TCCGACAGGA 
GTTOGGGTTC 
TCCCCAGCTC 
AATTGAGCAA 
CTGAGCCTCT 
ATGGTGTATG 
TAAGCATTAG 
ACCTTCTTAG 
TAGCAAGCAG 
GCTTTGGAGC 
CTGTGTACAC 
TTTTTTTCCT 



31 
I 

GGACGAGGGC 
GGCTCCGTGC 
GCCGAGAGAA 
TTTAAAACTT 
CTGCTCTGTT 
CCAGAGGGAC 
CCATCTGTGA 
TCCGTCGGCC 
TTCATCATTC 



41 
I 

AGATCTCGTT 
GGOCAAGTTT 
GGGTCTTACC 

ciu r m-Ti u 

TCTGGAGCCT 
GGCTGGAAGA 
GGTTTAACCT 
ACAGCCAGCC 
AOGGATGGAC 



GCCCACCAGC 
AGGATCCTCG 



ACACTTTTCA 
ATTATTATCC 
ATATACTAGT 
CTCTGAAACT 
CAAACAAGGA 
TGTATAACCA 
GAAGTGTGAC 
ACATTGTTTT 
CCAOTTOAGT 
TGATTGCTTT 
TGGAACCAGA 
ATAATGAGAT 
TTACACCTTA 
ATQAAGCATG 
CTCAAAAAGT 
AACCCATATT 
CTTTGCA 



GGCCGAATCA 
AGGCTCTCTC 
GGCTTGAGCA 
GACTTCCAGC 
ACAGAGGTGG 
AATCAGGACA 
ATCTGTCTGA 
AGGAACAAGA 
TACCATTATC 
ACCTTTTACG 
GTGGAGOGGA 
GGAGACCTCT 
TGGAAGGAGT 
AGGCGCATCC 
CCTGAGAACA 
TGGAGGATGA 
TCTTGCCAGC 
GAGGACCCAC 
GOTCTCCTGT 
OCCTCTCTTG 
GATTTGTGTG 
GTTGCTGGGC 
TAGAAGGCTG 
GCTGACTCAT 
TCATGGGTTG 
ACTCTCTGAA 
ATGTCTATTG 
CCATTTGGAA 
TGTATATGCA 
GGACCCTGAO 
TTATTATGTG 
AGTATCATTC 
AAATCTCTTC 
AGAAACCGGC 
CTTTGAAATA 
CCAATTCTAG 
AAAGATTTGT 
CTATTTTACA 
TAGTGGTGCA 
CCATGAAGAC 

L Tmrrm ' 

GGAGAACATC 
ACCAAGCAGG 
AATCTGTATC 
CTTATTTCAA 
CTAGCCAAAG 
AGTGTTTCAA 
ATTTTGGGAA 
TCCACATAGC 
GTTTGTGGTA 
TTTGGATTTT 
ATTGGCTGTA 



TTTACACGGA 
ACTGGCTGCA 
TCGGAGCGCA 
CAGGTTTGGA 
CGGACGATGC 
TTGGTATTCA 
CAGGCTGTGG 
TAAAATQTGA 
AOCCGAGTTT 
GCTGCCGCAA 
GGAACAGCAA 
AGATGAAAAT 
TCACCCTTTA 
TCGAGCAGAA 
TGAAGATCCA 
TTCGCAGCTA 
GGGTGAAGTC 
CCAGCATATC 
AAAACGAAAC 
AAGGCAGCAA 
CCAATGGAAG 
GATGGCTGGG 
AATAGCTCTA 
TAAGCAGCTG 
GAGCCTGTAC 
TCAGCTGCTC 
GTCAGGATGG 
CACTGACCAT 
GCACACATCA 
AACACTTAAA 
CCCAGTGGAG 
TGGGGAAAGG 
GCTTTAAGTG 
CCACCTCCCC 
ATGCTGGGGC 
TGTTTAGAGA 
ACCTGCCAGA 
TTTTACTTTA 
ATCTTGATTT 
GTATGCATTG 
AAACTGAGGC 
GCCAGGGAGA 
TTTTGCAGCC 
TATATAGAGC 
TGTGCCAACG 
AGAGGAAGAA 
GTGCCAAAGT 
GATATGTTCT 
CGAAGCTTGT 
ATTTTAGATT 
TAGGACCCGA 
CTGAGGGTAA 
ACTTATGTGA 
GGAGCATTTC 
CATCCTGGTC 
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I 

CTGGGGCAAG 
TCATTTTCCA 
GGCCGGGATT 
GGAGGGGGTG 
CTGCTATTGC 
TAAGCTCCAC 
CCGCACCTCC 
CTTAGAAGAC 
GATGAGCGGT 
GAAAGACGCC 
TGCGGTCAAT 
GGAGAAGGAC 
CGTGGCCGGG 
TCCTGCOGGG 
AGATTTTGTG 
GATGCCTGTG 
ACTCAAOGAT 
GCATGAGCGA 
TGCCTTCCAG 
GAACCGTTGT 
AATGTACCTA 
CCATGTCTTC 
TGGCACTAAT 
TGCCACCAAC 
GCTCACCTGG 
CCTGTCTCAA 
TGGGGAAACC 
CCCAGGCCGG 
CAGTCCCACT 
GACTTCCTGC 
GATTCTTCTC 
ACTCCTCGCa 
ACTCCAAACC 
GGTGCCTGGG 
TCTGCCTGAC 
AGCCTGCTTT 
CAGGCCTGGT 
ACTGCTTACG 
TTGGCTTTCC 
ATTAATTAGA 
TTGGGATTTC 
CACCTGGGGC 
GTTTCTATTT 
TATGAGTGAC 
CAGAATGATG 
GATAGAAGTT 
CAGAGCTGGT 
ATACCAGTGC 
GAATTAATAC 
CCTAATTAGA 
TTAGTGAGGT 
GGACTCAGAT 
AGTTCCCACC 
CATCCATAAA 
GTTGGACTTT 
TGATTTTCTT 
TGTATCACTG 
CAAGCCAATT 
ACAGGTTGAG 
ATTTTCAGAT 
GCCTAAACAC 
TTTTATACGA 
OGGGTTTTCC 
GGATTTTGGA 
ACTTCTGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 



Seq ID NO: 229 frgtefa sequence; 



286 



WO 02/079492 



PCT/US02/04915 



Protein Accession ft: NPJJ06024.1 



1 11 21 31 41 51 

I I I I I I 

KSNSVPLLCF WSLCYCFAAG SPVPFQPEGR LEDKLHKPKA TQTEVKPSVR FNLRTSKDPB 60 

HBQCYLSVGH SQPLEDC3PN MTAKTFPIIH GNTHSOIPEH K1HKLVSALH TRBKDANVW 120 

VDWLPIAHQL YTDAVNNTRV VQHSIARMLD WLQKKDDFSU GHVHLIGVSL OAHVAOYA02J 180 

PVKGTVGRIT GLDPAGPKFB OAOIHKRIiSP DDADPVDVLH TYTRSFGIiSI OIQMPVGHID 240 

IYPNGGDFQP GCGWDVUGS 1AYGTITBW KCEHERAVHL FVDSLVNQDK PSPAPQCTDS 300 

HRPKKGICLS CRKNRCK3IQ YNAKKMRNKR N3KKYLKTRA GMPFRVYHYQ MKIHVF8YKN 360 

HGKIBPTPyV TLYOTSA0SQ TLPLKIVBRI EQNATWTFLV YTEEDLGDLL KIQLTWEGAS 420 

QSWYNMOCBP RSYLSQPRNP GRELN1RRIR VKSGBTQRKIi TFCTEDPENT SISPGRBLWF 460 
RKCROGWRMK NBTSPTVELP 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 
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WHATISCLAIMEPIS; 

1 1 . A method of detecting an angiogenesis-associated transcript in a cell in 

2 a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridized to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1 -8. 

1 2. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 3. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 4. The method of claim 3, wherein the nucleic acids are mRNA. 

1 5. The method of claim 3, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 6. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1 -8 . 

1 7. The method of claim 1 , wherein the polynucleotide is labeled. 

1 8. The method of claim 7, wherein the label is a fluorescent label. 

1 9. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 10. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat a disease associated with angiongenesis. 

1 11. The method of claim 1, wherein the patient is suspected of having 

2 cancer. 

1 1 2, An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -8, 

1 13. The nucleic acid molecule of claim 1 2, which is labeled. 

1 1 4. The nucleic acid of claim 1 3, wherein the label is a fluorescent label 
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1 1 5. An expression vector comprising the nucleic acid of claim 1 2. 

1 1 6. A host cell comprising the expression vector of claim 1 5. 

1 1 7. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1 -8 

1 1 8. An antibody that specifically binds a polypeptide of claim 1 7. 

.1 19. The antibody of claim 1 8, further conjugated or fused to an effector 

2 component. 

1 20. The antibody of claim 1 9, wherein the effector component is a 

2 fluorescent label. 

1 21. Hie antibody of claim 1 9, wherein the effector component is a 

2 radioisotope. 

1 22. The antibody of claim 19, which is an antibody fragment. 

1 23 . The antibody of claim 1 9, which is a humanized antibody 

1 24. A method of detecting a cell undergoing angiogenesis in a biological 

2 sample from a patient, the method comprising contacting the biological sample with an 

3 antibody of claim 1 8. 

1 25 . The method of claim 24, wherein the antibody is further conjugated or 

2 fused to an effector component. 

1 26. The method of claim 25, wherein the effector component is a 

2 fluorescent label. 

1 27. The method of detecting antibodies specific to angiogenesis in a 

2 patient, the method comprising contacting a biological sample from the patient with a 

3 polypeptide which is encoded by a nucleotide sequence of Tables 1-8. 
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